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DGATls 



DGAT2S 




AaDGATl-XP_001658299 (0.1526) 
DnCGATl-NP_609813. 1 (-0.0003) 
DrrCGATl-NP_995724.1 (0.0003) 
— TcDGATl-XP_975142.1 (0.1764) 
BbDGATl-AAZ22403. 1 (0.0095) 
BtDGATl-NP_777118.2 (0.0028) 
OaDGATl-NPJ01103634.1 (0.0055) 
ChDGATl-ABD59375.1 (-0.0105) 
SsDGATl-NP_999216.1 (0.0431) 
CfDGATl-XP_849176.1 (0.0129) 
CfDGATl-XP_858062.1 (0.0143) 
HsDGATl-NP_036211.2 (0.0138) 
MrrfX3ATl-XP_001090134. 1 (0.0190) 
PtDGATl-XP_520014.2 (0.0394) 
MmDGATl-NP_034176.1 (0.0221) 
RnDGATl-NP_445889.1 (0.0281) 
OCDGAT1-XPJ02724427.1 (0.1166) 
MdDGATl-XP_001371565.1 (0.1379) 
DrDGATl-NP_001002458.1 (0.1764) 
DrDGATl-NP_956024.1 (0.1690) 
SkDGATl-XP_002736160.1 (0.2322) 



DdDGATl-XP_645633.2 (0. 1977) 
PpDGATl-EFA85004. 1(0. 1922) 
AcDGAT2-EGC41804.1 (0.0762) 
PbDGATl-EEH17170.1 (0.0785) 
AODGAT2-EEQ31683.1 (0.1400) 
MaDGAT2-EFY86774.1 (0.0212) . 
MaDGAT2-EFY97444.1 (0.0146) 
XPJ02177753. 1 (0.3008) 
TgDGATl-AAP94209.1 (0.3776) 
rvDMT1-FFN50fi97.U0.7783) 



-SrtCGATl-XP_002964165.1 (0.1481) 

-OsDGATl-NP_001054869.2 (0.0857) 
-SbDGATl-XP_002439419.1 (0.0600) 
j— SbDGATl-XP_002437165.1 (0.0260) 
I — 7i 





-ZmDGATl-2-EU039830 (0.0312) 

VgDGATl-ABV21945.1 (0.1634) 

-EpDGATl-AC055635.1 (0,1026) 
-OeDGATl-AAS01606.1 (0.1110) 

— PfDGATl-AAG23696. 1 (0. 1351) 

— NtDGATl-AAF19345. 1 (0. 1378) 
— AtDGATl-NP_179535.1 (0.0627) 

BjDGATl-AAY40784. 1 (0.0070) 
BnDGATl-AAD45536.1 (0.0109) 
BjDGAT2-AAY40785. 1 (0.0236) 
BnDGAT2-AAD40881.1 (-0.0016) 

TmDGATl-AAM03340.2 (0.1550) 

EaDGATl-AAV31083.1 (0.1257) 
GrrfiGATl-AAS78662.1 (0.0141) 
GmDGATl-BAE93461.1 (0.0280) 
LjDGATl-AAW51456. 1 (0.0968) 

MtDGATl-ABN09107.1 (0.1207) 
WDGAT1-XP_002279345.1 (0.1356) 
JCDGAT1-ABB84383. 1 (0.0552) 
VfDGATl-DQ356680. 1 (0.0619) 
RcDGATl-XP_002514132.1 (0.1112) 
PtDGATl-XP_002308278.1 (0.0469) 
PtDGATl-XP_002330510.1 (0.0652) 



-AcDGAT2-XP_003225477.1 (0.1084) 
r BtDGAT2-DAA21853.1 (0.0160) 
MmDGAT2-NP_080660.1 (0.0057) 
RnDGAT2-NP_001012345.1 (0.0072) 
HSDGAT2-AAQ88896.1 (0.0036) 
L PaDGAT2-XP_002822304.1 (0.0067) 

XtDGAT2-NP_989372.1 (0.1120) 

DrDGAT2-NP_001025367.1 (0.1107) 

IpDGAT2-NP_001188005.1 (0.1136) 

BtDGAT2-XP_875499.3 (0.1480) 

rHsDGAT2-NP_835470.1 (0.0071) 

^PtDGAT2-XP_527842.2 (0.0076) 
-OaDGAT2-XP_001518899.1 (0.1482) 

DdDGAT2-XP_635762. 1 (0. 1687) 

PpDGAT2-EFA83646. 1 (0. 1778) 

-X-UrDGAT2A-AAK84179.1 (0.2462) 
--^UrDGAT2B-AAK84180.1 (0.2523) 

NvDGAT2-XP_001630435.1 (0.2402) 

NvDGAT2-XP_001633322.1 (0.2358) 

NvDGAT2-XP_001635548.1 (0.2307) 

— BtDGAT2-XP_002683800.1 (0.1914) 
— TgDGAT2-XP_002187643.1 (0.1857) 

MdDGAT2-XP_001365685.1 (0.2390) 

CiDGAT2-XP_002120879.1 (0.2661) 

iCeDGAT2-NP_505413,l (0.0018) 

'ceDGAT2-NP_872180.1 (-0.0018) 



ScDGAT2-NP_014888.1 (0.2326) 

— — 5pDGAT2-XP_001713160.1 (0.3047) 

^-TcDGAT2-XP_975146.1 (0.3157) 

CrDGAT2-XP_001694904.1 (0.3752) 

CrDGAT2-XP_001693189.1 (0.3347) 

-CVDGAT2-EFN51306.1 (0.2814) 
-OtDGAT2-XP_003083539.1 (0.3386) 



PpDGAT2-XP_001758758.1 (0.2100) 

PpDGAT2-XP_001777726.1 (0.1848) 

SmDGAT2-XP_002972054.1 (0.2082) 

— PsDGAT2-ABK26256.1 (0.1910) 

SpDGAT2-AAQ89590.1 (0.2749) 

-RcDGAT2-XP_002528531.1 (0.1468) 
VgDGAT2-ACV40232.1 (0.2818) 



— VTDGAT2-DG356682 (0. 1599) 
-EODGAT2-AC035365.1 (0.1434) 
-HvDGAT2-BAJ85730.1 (0.0955) 
-OsDGAT2-NP_001057530 (0.0877) 
-OsDGAT2-NP_001047917 (0.0781) 
-SbDGAT2-XP_002452652.1 (0.0194) 
-ZmDGAT2-NP_001150174.1 (0.0197) 

AtDGAT2-NP_566952 (0.1006) 

rBnDGAT2a-ACO90187 (0.0152) 

L BnDGAT2 b-ACO90188 (0.0132) 

— WDGAT2-XPJ02263626 (0. 1313) 

— EaDGAT2-ADF57328.1 (0.1509) 
GrrOGAT2-ACU20344.1 (0.1847) 

— MtDGAT2-ACJ84867.1 (0.1244) 

HaDGAT2-ABU50328.1 (0.1788) 

OeDGAT2-ADG22608.1 (0.1585) 

-PtDGAT2-XP 002317635.1 (0.1231) 



Structure-Function Analysis of Diacylglycerol 
Acyltransferase Sequences from 70 Organisms 



Cao 



O 



BiolVted Central Cao BM( ~ Research Notes 201 '< 4:249 

http://www.biomedcentral.eom/1756-0500/4/249 (21 July 2011) 



Cao BMC Research Notes 201 1, 4:249 
http://www.biomedcentral.eom/1756-0500/4/249 



Research Notes 



RESEARCH ARTICLE Open Access 



Structure-Function Analysis of Diacylglycerol 
Acyltransferase Sequences from 70 Organisms 

Heping Cao 



Abstract 

Background: Diacylglycerol acyltransferase families (DGATs) catalyze the final and rate-limiting step of 
triacylglycerol (TAG) biosynthesis in eukaryotic organisms. Understanding the roles of DGATs will help to create 
transgenic plants with value-added properties and provide clues for therapeutic intervention for obesity and 
related diseases. The objective of this analysis was to identify conserved sequence motifs and amino acid residues 
for better understanding of the structure-function relationship of these important enzymes. 

Results: 117 DGAT sequences from 70 organisms including plants, animals, fungi and human are obtained from 
database search using tung tree DGATs. Phylogenetic analysis separates these proteins into DGAT1 and DGAT2 
subfamilies. These DGATs are integral membrane proteins with more than 40% of the total amino acid residues 
being hydrophobic. They have similar properties and amino acid composition except that DGATIs are 
approximately 20 kDa larger than DGAT2s. DGATIs and DGAT2s have 41 and 16 completely conserved amino acid 
residues, respectively, although only two of them are shared by all DGATs. These residues are distributed in 7 and 
6 sequence blocks for DGATIs and DGAT2s, respectively, and located at the carboxyl termini, suggesting the 
location of the catalytic domains. These conserved sequence blocks do not contain the putative neutral lipid- 
binding domain, mitochondrial targeting signal, or ER retrieval motif. The importance of conserved residues has 
been demonstrated by site-directed and natural mutants. 

Conclusions: This study has identified conserved sequence motifs and amino acid residues in all 117 DGATs and 
the two subfamilies. None of the completely conserved residues in DGATIs and DGAT2s is present in recently 
reported isoforms in the multiple sequences alignment, raising an important question how proteins with 
completely different amino acid sequences could perform the same biochemical reaction. The sequence analysis 
should facilitate studying the structure-function relationship of DGATs with the ultimate goal to identify critical 
amino acid residues for engineering superb enzymes in metabolic engineering and selecting enzyme inhibitors in 
therapeutic application for obesity and related diseases. 



Background 

The complete genomes of many organisms including 
human, mouse, Arabidopsis and rice have been 
sequenced. The immediate challenge of post-genomic 
biology is to determine the biological functions of pro- 
teins encoded by unknown genes. Many endogenous 
proteins occur in extremely low abundance (such as the 
anti-inflammatory protein tristetraprolin/zinc-finger pro- 
tein 36, TTP/ZFP36) [1] and are labile (such as omega-3 
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fatty-acid desaturase, FAD3) [2], which complicates 
characterization of those proteins. 

One approach to gain clues about the structure-func- 
tion relationship of proteins is to perform comprehen- 
sive amino acid sequence analysis. It is generally 
accepted that critical amino acid residues and sequence 
motifs in the same family of proteins are evolutionarily 
conserved. We previously used a protein sequence ana- 
lysis approach to identify conserved sequence motifs 
and critical amino acid residues in several families of 
proteins from diverse organisms. The protein sequences 
we analyzed previously include the TTP/ZFP36 family 
involved in mRNA binding and destabilization [3,4], 
adenylate translocators [5], starch/glycogen synthases 
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[6], starch/glycogen branching enzymes [7,8], and 
starch/glycogen debranching enzymes [9]. 

Triacylglycerols (TAGs) are the major molecules of 
energy storage in eukaryotes. They also serve as a reser- 
voir of fatty acids for membrane biogenesis and lead to 
obesity due to excessive accumulation in adipose tissues. 
Diacylglycerol acyltransferase families (DGATs) are inte- 
gral microsomal membrane proteins that catalyze the 
last and rate-limiting step of TAG biosynthesis in eukar- 
yotic organisms. DGATs esterify s«-l,2-diacylglycerol 
with a long-chain fatty acyl-CoA. DGAT genes have 
been isolated from many organisms. At least two forms 
of DGATs are present in mammals [10,11] and plants 
[12,13] with additional forms reported in burning bush 
{Euonymus alatus) [14], peanut [15] and Arabidopsis 
[16]. Plants and animals deficient in DGATs accumulate 
less TAG [17-19]. Animals with reduced DGAT activity 
are resistant to diet-induced obesity [18,20] and lack 
milk production [18]. Over-expression of DGAT 
enzymes increases TAG content in plants [14,21-26], 
animals [27-30] and yeast [31]. DGATs have nonredun- 
dant functions in TAG biosynthesis in species such as 
mice [19] and tung tree (Vernicia fordii) [13]. Mice defi- 
cient in DGAT1 are viable, have modest decreases in 
TAG, and are resistant to diet-induced obesity [18,32]. 
In contrast, mice deficient in DGAT2 have severe reduc- 
tion of TAG and die shortly after birth [19]. The fact 
that DGAT1 is unable to compensate for the deficiency 
in DGAT2 knockout mice indicates the nonredundant 
functions of each DGAT isoform in TAG biosynthesis 
during mammal development. Therefore, understanding 
the roles of DGATs in plants and animals will have tre- 
mendous potential in creating new oilseed crops with 
value-added properties and providing information for 
therapeutic intervention for obesity and related diseases. 

Limited numbers of DGAT amino acid sequences 
were analyzed previously [13,33]. However, there is a 
lack of comprehensive analysis of amino acid sequences 
of DGATs among diverse organisms. The objective of 
this analysis was to identify conserved sequence motifs 
and amino acid residues in 117 DGATs from 70 organ- 
isms to provide a better understanding of the structure- 
function relationship of these important enzymes. 

Results 

Phylogenetic analysis and classification of DGATs 

A database search using tung tree {Vernicia fordii) 
DGAT1 and DGAT2 protein sequences [13] has identi- 
fied 117 DGAT sequences from 70 organisms including 
plants, animals, fungi and human (Table 1). More than 
two forms of DGATs are present in a number of spe- 
cies. For example, Bos taurus (cow) and Brassica napus 
(rape) have four forms of DGATs, whereas Homo 
sapiens (human) and Danio rerio (zebrafish) have three 



forms of DGATs (Table 1). Phylogenetic analysis indi- 
cates that all 117 DGAT protein sequences are grouped 
in the same phylogenetic tree (data not shown) and 
clearly separated into DGAT1 and DGAT2 subfamilies 
(Figure 1). DGATls are more conserved than DGAT2s. 
DGATs from plants, animals and fungi are also dis- 
tinctly separated from each other with a few exceptions 
(Figure 1). 

Six DGATs previously classified as DGAT2s in the 
GenBank databases [GenBank:EGC41804.1, GenBank: 
EEQ31683.1, GenBank:AAY40785.1, GenBank: 
AAD40881.1, GenBank:EFY86774.1 and GenBank: 
EFY97444.1] are clustered within the DGAT1 subfamily 
in this phylogenetic analysis (Figure 1). More detailed 
sequence analysis within the multiple sequence align- 
ment also indicates that their sequences are more simi- 
lar to DGATls (see below). Therefore, these six 
DGAT2s are reclassified as DGATls (AcDGATl- 
EGC41804.1, AoDGATl-EEQ31683.1, BjDGATlb- 
AAY40785.1, BnDGATlb-AAD40881.1, MaDGATla- 
EFY86774.1, and MaDGATlb-EFY97444.1, respectively) 
(Table 1). 

DGAT properties and amino acid composition 

It is generally observed that DGATls are much bigger 
proteins with more transmembrane domains than 
DGAT2s. The properties and amino acid composition of 
109 full-length DGATs were analyzed and calculated 
according to the classifications of DGAT1 and DGAT2 
subfamilies by phylogenetic analysis. The eight partial 
sequences listed in Table 1 were excluded from this 
analysis (AaDGATl-XP_001658299, BnDGATlb- 
AAD40881.1, BtDGAT2c-XP_002683800.1, ChDGATl- 
ABD59375.1, CvDGAT2-EFN51306.1, OaDGAT2- 
XP_001518899.1, PtDGATl-XP_002177753.1 and 
PtDGAT2-XP_002317635.1). 

DGATls have an average of 515 amino acid residues 
with a standard deviation of 44 amino acid residues 
among the 55 sequences (Table 2). The average residue 
number of DGAT1 is 171 amino acids greater than the 
average DGAT2 length, which has an average of 344 
residues with a standard deviation of 29 residues among 
the 54 sequences (Table 2). This corresponds to 
approximately 20 kDa difference in the molecular mass. 
The isoelectric points of DGATls and DGAT2s are 
similar with an average value of 9.17 and 9.28, respec- 
tively, however DGATls have approximately 4 more 
charges at pH 7 than DGAT2s (Table 2). 

The frequency of functional amino acid residue groups 
between DGAT1 and DGAT2 subfamilies is remarkably 
similar (Table 2). DGATls and DGAT2s have charged 
residues (RKHYCDE) of approximately 26%, which 
includes approximately 7% of acidic residues (DE) and 
10% of basic residues (KR). The frequency of polar 
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Table 1 DGATs sequence information 



No. DGAT name Organism 



DGAT GenBank* 



No. DGAT name 



Organism 



DGAT GenBank* 



1 AaDGATI - 
XP_001 658299 

2 AcDGATI- 
EGC41 804.1 ** 

3 AcDGAT2- 
XP_003225477.' 

4 AgDGAT2- 
NP_983542.1 

5 AoDGATI - 
EEQ31 683.1 ** 

6 AtDGATI - 
NP_1 79535.1 

7 AtDGAT2- 
NP_566952 

8 BbDGATI- 
AAZ22403.1 

9 BjDGATI a- 
AAY40784.1 

10 BjDGATI b- 
AAY40785.1 ** 

11 BnDGATIa- 
AAD45536.1 

12 BnDGATIb- 
AAD40881.1 *, 5 

13 BnDGAT2a- 
ACO90187 

14 BnDGAT2b- 
ACO90188 

15 BtDGATI - 
NP_7771 18.2 

16 BtDGAT2a- 
DAA21853.1 

1 7 BtDGAT2b- 
XP_875499.3 

18 BtDGAT2c- 
XP_002683800.1 

19 CeDGAT2a- 
NP_505413.1 

20 CeDGAT2b- 
NP_872180.1 

21 CfDGATI b- 
XP_849176.1 

22 CfDGATI c- 
XP_858062.1 

23 ChDGATI- 
ABD59375.1 * 

24 QDGAT2- 
XP_0021 20879.1 

25 CrDGAT2a- 
XP_001 694904.1 

26 CrDGAT2b- 
XP_001693189.1 

27 CvDGATI - 
EFN50697.1 

28 CvDGAT2- 
EFN5 1306.1 * 



Aedes aegypti (A) *** 

Ajellomyces capsulatus 
(F) 

Anolis carolinensis (A) 

Ashbya gossypii (F) 

Arthroderma otae (F) 

Arabidopsis thaliana 
(P) 

Arabidopsis thaliana 
(P) 

Bubalus bubalis (A, 
buffalo) 

Brassica juncea (P) 
Brassica juncea (P) 
Brassica napus (P) 
Brassica napus (P) 
Brassica napus (P) 
Brassica napus (P) 



1 XP_001 658299 

1 EGC4 1804.1 

2 XP_003225477.1 
2 NP_983542.1 

1 EEQ3 1683.1 

1 NP_1 79535.1 

2 NP_566952 

1 AAZ22403.1 
la AAY40784.1 
1b AAY40785.1 
la AAD45536.1 
1b AAD40881.1 

2 ACO90187 
2 ACO90188 



Bos taurus (A, cattle) 1 

Bos taurus (A, cattle) 2a 

Bos taurus (A, cattle) 2b 

Bos taurus (A, cattle) 2c 



Caenorhabditis elegans 2a 
(A, worm) 

Caenorhabditis elegans 2b 
(A, worm) 

Canis familiaris 1b 
(A, dog) 

Canis familiaris 1c 
(A, dog) 

Capra hircus 1 
(A, sheep) 

Ciona intestinalis (A) 2 



Chlamydomonas 
reinhardtii (Algae) 

Chlamydomonas 
reinhardtii (Algae) 

Chlorella variabilis 
(Algae) 

Chlorella variabilis 
(Algae) 



2a 
2b 



60 NtDGATI - 
AAF1 9345.1 

61 NvDGAT2a- 
XP_001 630435.1 

62 NvDGAT2b- 
XP_001 633322.1 

63 NvDGAT2c- 
XP_001 635548.1 

64 OaDGATI- 
NP_001 103634.1 

65 OaDGAT2- 
XP_001518899.1 * 

66 OcDGATI- 
XP_002724427.1 

67 OeDGATI - 
AAS0 1606.1 

68 OeDGAT2- 
ADG22608.1 

69 OsDGATI - 
NP_001 054869.2 

70 OsDGAT2a- 
NP_001047917 

71 OsDGAT2b- 
NP_001 057530 

72 OtDGAT2- 
XP_003083539.1 

73 PaDGAT2- 
XP_002822304.1 



NP_7771 18.2 74 

DAA21 853.1 75 

XP_875499.3 76 

XP_002683800.1 77 

NP_505413.1 78 

NP_872 180.1 79 

XP_849176.1 80 

XP_858062.1 81 

ABD59375.1 82 

XP_002120879.1 83 

XP_001694904.1 84 

XP_001693189.1 85 

EFN50697.1 86 

EFN51 306.1 87 



PbDGATI- 
EEH17170.1 

PfDGATI - 
AAG23696.1 

PpDGATI- 
EFA85004.1 

PpDGAT2- 
EFA83646.1 

PpDGATI- 
XP_001 770929.1 

PpDGAT2a- 
XP_001 758758.1 
PpDGAT2b- 
XP_001 777726.1 

PsDGAT2- 
ABK26256.1 

PtDGATI - 
XP_520014.2 

PtDGAT2- 
XP_527842.2 

PtDGATI - 
XP_0021 77753.1 

PtDGATI a- 
XP_002308278.1 
PtDGATI b- 
XP_002330510.1 

PtDGAT2- 

XP 002317635.1 



Nicotiana tabacum 
(P, tobacco) 

Nematostella vectensis 
(A, worm) 

Nematostella vectensis 
(A, worm) 

Nematostella vectensis 
(A, worm) 

Ovis aries (A, sheep) 

Ovis aries (A, sheep) 

Oryctolagus cuniculus 
(A, rabbit) 

Olea europaea (P, tree) 

Olea europaea (P, tree) 

Oryza sativa (P, rice) 

Oryza sativa (P, rice) 

Oryza sativa (P, rice) 

Ostreococcus tauri 
(Algae) 

Pongo abelii 
(A, Sumatran 
orangutan) 

Paracoccidioides 
brasiliensis (F) 

Perilla frutescens (P) 

Polysphondylium 
pallidum (F, mold) 

Polysphondylium 
pallidum (F, mold) 



Physcomitrella patens 1 
(P, moss) 

Physcomitrella patens 1 
(P, moss) 

Physcomitrella patens 2b 
(P, moss) 

Picea sitchensis (P, tree) 2 

Pan troglodytes 1 
(A, chimpanzee) 

Pan troglodytes 2 
(A, chimpanzee) 

Phaeodactylum 1 
tricornutum (F) 

Populus trichocarpa 1a 
(P, tree) 

Populus trichocarpa 1 b 
(P, tree) 

Populus trichocarpa (P, 2 
tree) 



1 AAF1 9345.1 

2a XP_001 630435.1 

2b XP_001 633322.1 

2c XP_001 635548.1 

1 NP_00 1103634.1 

2 XP_001 5 18899.1 
1 XP_002724427.1 

1 AAS0 1606.1 

2 ADG22608.1 

1 NP_00 1054869.2 
2a NP_001047917 
2b NP_001 057530 

2 XP_003083539.1 
2 XP_002822304.1 



I 



EEH17170.1 
AAG23696.1 
EFA85004.1 
EFA83646.1 

XP_00 1770929.1 
XP_001 758758.1 
XP_001 777726.1 
ABK26256.1 
XP_520014.2 
XP_527842.2 
XP_0021 77753.1 
XP_002308278.1 
XP_002330510.1 
XP_0023 17635.1 
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Table 1 DGATs sequence information (Continued) 



29 DdDGATI- 
XP_645633.2 

30 DdDGAT2- 
XP_635762.1 

31 DmDGATIa- 
NP_609813.1 

32 DmDGATId- 
NP_995724.1 

33 DrDGATI a- 
NP_956024.1 

34 DrDGATI b- 
NP_001 002458.1 

35 DrDGAT2- 
NP_001025367.1 

36 EaDGATI- 
AAV3 1083.1 

37 EaDGAT2- 
ADF57328.1 

38 EoDGAT2- 
AC035365.1 

39 EpDGATI - 
AC055635.1 

40 GmDGATIa- 
AAS78662.1 

41 GmDGATI b- 
BAE93461.1 

42 GmDGAT2- 
ACU20344.1 

43 HaDGAT2- 
ABU50328.1 

44 HsDGATI- 
NP_036211.2 

45 HsDGAT2a- 
AAQ88896.1 

46 HsDGAT2b- 
NP_835470.1 

47 HvDGAT2- 
BAJ85730.1 

48 lpDGAT2b- 
NP_001 188005.1 

49 JcDGATI- 
ABB84383.1 

50 LjDGATI- 
AAW5 1456.1 

51 MaDGATIa- 
EFY86774.1 ** 

52 MaDGATIb- 
EFY97444.1 ** 



DictyosteHum 1 
discoideum (mold) 

Dictyostelium 2 
discoideum (mold) 

Drosophila 1 a 

melanogaster (A, fly) 

Drosophila 1 c 

melanogaster (A, fly) 

Danio rerio 1 a 
(A, zebrafish) 

Danio rerio 1 1 
(A, zebrafish) 

Danio rerio 2 
(A, zebrafish) 

Euonymus alatus (P) 1 

Euonymus alatus (P) 2 

Elaeis oleifera (P) 2 

Echium pitardii (P) 1 



Glycine max 
(P, soybean) 

Glycine max 
(P, soybean) 

Glycine max 
(P, soybean) 

Helianthus annuus (P) 



Homo sapiens 
(human) 

Homo sapiens 
(human) 

Homo sapiens 
(human) 

Hordeum vulgare 
(P, barley) 

Ictalurus punctatus 
(A, catfish) 

Jatropha curcas (P) 
Lotus japonicas (P) 



Metarhizium acridum 1a 
(F) 

Metarhizium anisopliae 1 b 
(F) 



2 
2 
1 

2a 
2b 
2 

2 b 



53 


MdDGATI- 


Monodelphis 




XP_001 371 565.1 


domestica 






(A, opossum) 


54 


MdDGAT2- 


Monodelphis 




XP_001 365685.1 


domestica 






(A, opossum) 


55 


MmDGATI- 


Mus musculus 




NP_034176.1 


(A, mouse) 


56 


MmDGAT2- 


Mus musculus 




NP_080660.1 


(A, mouse) 


57 


MmDGATI- 


Macaca mulatto 




XP_001 0901 34.1 


(A, monkey) 



XP_645633.2 

XP_635762.1 

NP_609813.1 

NP_995724.1 

NP_956024.1 

NP_001002458.1 

NP_001 025367.1 

AAV3 1083.1 

ADF57328.1 

AC035365.1 

AC055635.1 

AAS78662.1 

BAE93461.1 

ACU20344.1 

ABU50328.1 

NP 036211.2 



NP_835470.1 
BAJ85730.1 
NP_001 188005/ 
ABB84383.1 
AAW5 1456.1 
EFY86774.1 
EFY97444.1 
XP_001 371 565.1 

XP_001 365685.1 

NPJB4176.1 
NP_080660.1 
XP 001090134.1 



88 RcDGATI - 
XP_002514132.1 

89 RcDGAT2- 
XP_002528531.1 

90 RnDGATI- 
NP_445889.1 

91 RnDGAT2- 
NP_001012345.' 

92 SbDGATIa- 
XP_002437 165.1 

93 SbDGATI b- 
XP_002439419.1 

94 SbDGAT2- 
XP_002452652.1 

95 ScDGAT2- 
NP_0 14888.1 

96 SkDGATI- 
XP_002736160.1 

97 SmDGATl- 
XP_002964165.1 

98 SmDGAT2- 
XP_002972054.1 

99 SpDGAT2- 
AAQ89590.1 

100 SpDGAT2- 
XP_001713160.1 

101 SsDGATI - 
NP_999216.1 

102 TcDGATI - 
XP_975142.1 

103 TcDGAT2- 
XP_975146.1 

104 TgDGATI- 
AAP94209.1 

105 TgDGAT2- 
XP_002187643.1 

106 TmDGATI- 
AAM03340.2 

107 U rDG AT2 A- 

AAK84179.1 

108 UrDGAT2B- 
AAK84180.1 

109 VfDGATI - 
DQ356680.1 

110 VfDGAT2- 
DQ356682.1 

111 VgDGATI- 
ABV2 1945.1 

112 VgDGAT2- 
ACV40232.1 

113 WDGAT1- 
XP_002279345.1 

114 WDGAT2- 
XP_002263626 

115 XtDGAT2- 
NP_989372.1 

116 ZmDGATIb- 
EU039830 



Ricinus communis 1 
(P, castor bean) 

Ricinus communis 1 
(P, castor bean) 

Rattus norvegicus 1 
(A, rat) 

Rattus norvegicus 2 
(A, rat) 

Sorghum bicolor 1a 
(P, sorghum) 

Sorghum bicolor 1 b 

(P, sorghum) 

Sorghum bicolor 2 
(P, sorghum) 

Saccharomyces 2 
cerevisiae (F, yeast) 

Saccoglossus 1 
kowalevskii (A, worm) 

Selaginella 1 
moellendorffii (P) 

Selaginella 2 
moellendorffii (P) 

Spirodela polyrhiza (P) 2 

Schizosaccharomyces 2 
pombe (F, yeast) 

Sus scrofa (A, pig) 1 

Tribolium castaneum 1 
(A) 

Tribolium castaneum 2 
(A) 

Toxoplasma gondii 1 
(A, worm) 

Taeniopygia guttata 2 
(A, bird) 

Tropaeolum majus (P) 1 



Umbelopsis 
ramanniana (F) 



Umbelopsis 2 
ramanniana (F) 

Vernicia fordii (P, tung 1 
tree) 

Vernicia fordii (P, tung 2 
tree) 

Vernonia galamensis 1 
(P) 

Vernonia galamensis 2 
(P) 



Vitis vinifera (P, grape) 1 

Vitis vinifera (P, grape) 2 

Xenopus tropicalis 2 
(A, frog) 

lea mays (P, corn) lb 



XP_0025 141 32.1 

XP_002528531.1 

NP_445889.1 

NP_001 01 2345.1 

XP_002437165.1 

XP_002439419.1 

XP_002452652.1 

NP_0 14888.1 

XP_002736160.1 

XP_002964165.1 

XP_002972054.1 

AAQ89590.1 

XP_001 713160.1 

NP_999216.1 

XP_975142.1 

XP_975146.1 

AAP94209.1 

XP_002 187643.1 

AAM03340.2 

AAK84 179.1 

AAK84180.1 

DQ356680.1 

DQ356682 

ABV21945.1 

ACV40232.1 

XP_002279345.1 

XP_002263626 

NP_989372.1 

EU039830 
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Table 1 DGATs sequence information (Continued) 

58 MtDGATI- Medicago truncatula 1 ABN09107.1 117 ZmDGAT2- lea mays (P, com) 2 NP_001 150174.1 
ABN09107.1 (P) NP_001 1501 74.1 

59 MtDGAT2- Medicago truncatula 2 ACJ84867.1 
ACJ84867.1 (P) 

*8 partial sequences were not used for statistics. **6 DGATs named as DGAT2s in the GenBank Databases should be DGATIs as indicated in the table because 
they are more similar to DGATIs. ***A: animal, F: fungus, P: plant. 



resides (NCQSTY) is also similar with an average of 25% 
vs. 22% in polar residues for DGATIs and DGAT2s, 
respectively. DGATIs and DGAT2s are integral mem- 
brane proteins with 42% and 41% of the total residues 
being hydrophobic (AILFWV), respectively (Table 2). 

Identification of amino acid residues and sequence motifs 
conserved among all DGATs 

The sequences between DGATIs and DGAT2s are very 
divergent. No common features between these two subfa- 
milies were reported previously using 7 DGATIs and 8 
DGAT2s from 10 organisms [13]. However, the biochem- 
ical function of the two DGAT isoforms is essentially 
identical in enzymatic assays, suggesting that certain 
common sequence conservations are probably present 
between them. Multiple sequence alignment was per- 
formed to analyze all 117 DGATs including 59 DGATIs 
from 48 organisms and 58 DGAT2s from 44 organisms. 
Sequence alignment has identified only two completely 
conserved amino acid residues among all DGATs. One 
perfectly conserved proline residue corresponds to P248 
in AaDGATl-XP_001658299 and P151 in VTDGAT2- 
DQ356682.1 (Figure 2A); the other residue, a perfectly 
conserved phenylalanine residue corresponds to F344 in 
AaDGATl-XP_001658299 and F225 in VfDGAT2- 
DQ356682.1 (Figure 2B). The conserved phenylalanine 
residue is followed by a glycine residue conserved in all 
except DGATIs of Aedes aegypti, Drosophila melanoga- 
ster and Tribolium castaneum (Figure 2B). The other 
proline residue in Figure 2C is almost completely con- 
served except DGAT2 of Ostreococcus tawci, which corre- 
sponds to P411 in AaDGATl-XP_001658299 and P276 
in VfDGAT2-DQ356682.1. Based on the sequence con- 
servation patterns with these conserved residues as 
anchors, the conserved sequence motifs among all 
DGATs are named as Motif 1 (P Block), Motif 2 (FG 
Block) and Motif 3 (P-l Block). The highly conserved 
residues among diverse organisms may be located at the 
active sites of the enzymes and play important roles in 
structure, substrate binding and/or catalysis. 

Identification of amino acid residues and sequence motifs 
conserved in DGATIs 

Multiple sequence alignment was performed to identify 
conserved amino acid residues and sequence motifs 
within the DGAT1 subfamily. Among the 55 full-length 



DGATIs, 41 amino acid residues are completely con- 
served, which correspond to 8.0% of the total 515 resi- 
dues of DGATIs. Table 3 shows the positions of the 41 
completely conserved residues in DGATIs from repre- 
sentatives of animal group (mouse), plant group (tung 
tree) and fungus group (Dictyostelium discoideum) . 
These completely conserved residues are located in 
seven sequence motifs of DGATIs. Based on the 
sequence conservation patterns with the completely 
conserved residues as anchors, the conserved sequence 
motifs of DGATIs are named as Motif 1 (GL Block), 
Motif 2 (KSR Block), Motif 3 (PTR Block) (Figure 3A- 
C), Motif 4 (QP Block), Motif 5 (LWLFFEFDR- 
FYWWNWWNPPFSHP Block) (Figure 4A-B), Motif 6 
(FQL Block) and Motif 7 (NGQPY Block) (Figure 5A-B). 
The great majority of the conserved residues are located 
within the carboxyl terminal halves of DGATIs. Among 
the completely conserved residues, 33 of them are 
located within the last 200 residues from the carboxyl 
termini in Motifs 4-7 (Figures 4, 5), and 23 of them are 
concentrated in the most conserved region with 
approximately 100 residues in Motif 6 (Figure 4B). The 
first two conserved residues (G, L) in Motif 1 start 
approximately 100 residues from the amino termini 
(Figure 3A). The next three conserved residues (K, S, R) 
in Motif 2 start until approximately 200 residues from 
the amino termini (Figure 3B). The last conserved resi- 
due (Y) in Motif 7 ends within the last 20 residues from 
the carboxyl termini of DGATIs (Figure 5B). 

Identification of amino acid residues and sequence motifs 
conserved in DGAT2s 

Multiple sequence alignment was also performed using all 
DGAT2s to identify conserved amino acid residues and 
sequence motifs within this subfamily. Sixteen residues are 
completely conserved in the 54 full-length DGAT2s, cor- 
responding to 4.7% of the total 344 residues. Table 3 
shows the positions of the 16 completely conserved resi- 
dues in DGAT2s from representatives of animal group 
(mouse), plant group (tung tree) and fungus group (Dic- 
tyostelium discoideum). Based on the sequence conserva- 
tion patterns with the completely conserved residues as 
anchors, the conserved sequence motifs of DGAT2s are 
named as Motif 1 (PH Block), Motif 2 (PR Block), Motif 3 
(GGE Block) (Figure 6A-C), Motif 4 (RGFA Block), Motif 
5 (VPFG Block) and Motif 6 (G Block) (Figure 7A-C). 



Cao BMC Research Notes 201 1, 4:249 
http://www.biomedcentral.eom/1756-0500/4/249 



Page 6 of 23 



B 




— AaDGATl-XP_001658299 (0.1526) 
_,DmDGATl-NP_609813.1 (-0.0003) 
~~ 'DmDGATl-NP_995724.1 (0.0003) 
— TcDGATl-XP_975142.1 (0.1764) 
■BbDGATl-AAZ22403.1 (0.0095) 
BtDGATl-NP_777118.2 (0.0028) 
OaDGATl-NP_001103634.1 (0.0055) 
J ChDGATl-ABD59375.1 (-0.0105) 
-SsDGATl-NP_999216.1 (0.0431) 

| rCfDGATl-XP_849176.1 (0.0129) 

LcfDGATl-XP_858062.1 (0.0143) 
HsDGATl-NP_036211.2 (0.0138) 
MmDGATl-XP_001090134.1 (0.0190) 
PtDGATl-XP_520014.2 (0.0394) 
MrrfiGATl-NP_034176.1 (0.0221) 
RnDGATl-NP_445889.1 (0.0281) 
OcDGATl-XP_002724427.1 (0.1166) 
MdDGATl-XP_001371565.1 (0.1379) 
DrDGATl-NP_001002458.1 (0.1764) 
DrDGATl-NP_956024.1 (0.1690) 
SkDGATl-XP_002736160.1 (0.2322) 



— DdDGATl-XP_645633.2 (0.1977) 
-PpDGATl-EFA85004.1 (0.1922) 

AcDGAT2-EGC41804.1 (0.0762) 

PbDGATl-EEH17170.1 (0.0785) 

AoDGAT2-EEQ31683.1 (0.1400) 

_r-MaDGAT2-EFY86774.1 (0.0212) . 
~l-MaDGAT2-EFY97444.1 (0.0146) 
iATl-XP_002177753, 1 (0.3008) 
TgDGATl-AAP94209.1 (0.3776) 



-PtDGAT 



-CVDGAT1-EFN50697. 1 (0.2783) 



PpDGATl-XP_001770929.1 (0.1794) 

SmDGATl-XP_002964165.1 (0.1481) 

OsDGATl-NP_001054869.2 (0.0857) 



-SbDGATl-XP_002439419.1 (0.060 

|—SbDGATl-XP_002437165.1 (0.0260) 

I — ZmDGATl-2-EU039830 (0.0312) 

VgDGATl-ABV21945.1 (0.1634) 

-EpDGATl-AC055635.1 (0.1026) 
-OeDGATl-AAS01606.1 (0,1110) 

— PfDGATl-AAG23696.1 (0.1351) 

— NtDGATl-AAF19345.1 (0.1378) 
— AtDGATl-NP_179535.1 (0.0627) 

BjDGATl-AAY40784. 1 (0.0070) 
BnDGATl-AAD45536.1 (0.0109) 
BjDGAT2-AAY40785.1 (0.0236) 
BnDGAT2-AAD40881.1 (-0.0016) 
TmDGATl-AAM03340.2 (0.1550) 

— EaDGATl-AAV31083.1 (0.1257) 
_r-GrrOGATl-AAS78662.1 (0.0141) 
~1— GmDGATl-BAE93461.1 (0.0280) 

LjDGATl-AAW51456.1 (0.0968) 

MtDGATl-ABN09107.1 (0.1207) 

WDGAT1-XP_002279345.1 (0.1356) 

JcDGATl-ABB84383.1 (0.0552) 
VfDGATl-DQ356680.1 (0.0619) 
RcDGATl-XP_002514132.1 (0.1112) 
-PtDGATl-XP_002308278.1 (0.0469) 

— PtDGATl-XP_002330510.1 (0.0652) 



AcDGAT2-XP_003225477.1 (0.1084) 
BtDGAT2-DAA21853.1 (0.0160) 
MrrDGAT2-NP_080660.1 (0.0057) 
RnDGAT2-NP_001012345.1 (0.0072) 
HSDGAT2-AAQ88896.1 (0.0036) 
L PaDGAT2-XP_002822304.1 (0.0067) 

XtDGAT2-NP_989372.1 (0.1120) 

DrDGAT2-NP_001025367.1 (0.1107) 

IpDGAT2-NP_001188005.1 (0.1136) 

BtDGAT2-XP_875499.3 (0.1480) 

rHsDGAT2-NP_835470.1 (0.0071) 

I PtDGAT2-XP_527842. 2 (0. 0076) 
-OaDGAT2-XP_001518899.1 (0.1482) 

DdDGAT2-XP_635762.1 (0.1687) 

-PpDGAT2-EFA83646.1 (0.1778) 
-UrDGAT2A-AAK84179.1 (0.2462) 
UrDGAT2B-AAK84180.1 (0.2523) 
-NvDGAT2-XP_001630435.1 (0.2402) 
-NvDGAT2-XP_001633322.1 (0.2358) 
-NvDGAT2-XP_001635548.1 (0.2307) 
-BtDGAT2-XP_002683800.1 (0.1914) 
-TgDGAT2-XP_002187643.1 (0.1857) 
-MdDGAT2-XP_001365685.1 (0.2390) 
— CiDGAT2-XP_002120879.1 (0.2661) 

iCeDGAT2-NP_505413.1 (0.0018) 

_ ICeDGAT2-NP_872180.1 (-0.0018) 



AgDrJA I 2-NM_yijjb42.UU.2iij>) — 

ScDGAT2-NP_014888.1 (0.2326) 

— SpDGAT2-XP_001713160.1 (0.3047) 

^^TcDGAT2-XP_975146.1 (0.3157) 

CrDGAT2-XP_001694904.1 (0.3752) 

CrDGAT2-XP_001693189.1 (0.3347) 

-CvDGAT2-EFN51306.1 (0.2814) 

-OtDGAT2-XP_003083539.1 (0.3386) 



PpDGAT2-XP_001758758.1 (0.2100) 

PpDGAT2-XP_001777726.1 (0.1848) 

SmDGAT2-XP_002972054. 1 (0.2082) 

-PSDGAT2-ABK26256.1 (0.1910) 
SpDGAT2-AAQ89590.1 (0.2749) 



-RcDGAT2-XP_002528531.1 (0.1468) 
VgDGAT2-ACV40232.1 (0.2818) 



-VfDGAT2-DG356682 (0.1599) 
EODGAT2-AC035365. 1 (0. 1434) 
HvDGAT2-BAJ85730.1 (0.0955) 
OsDGAT2-NP_001057530 (0.0877) 
OsDGAT2-NP_001047917 (0.0781) 
SbDGAT2-XP_002452652.1 (0.0194) 
ZmDGAT2-NP_001150174.1 (0.0197) 
-AtDGAT2-NP_566952 (0.1006) 
_rBnDGAT2a-ACO90187 (0.0152) 




L BnDGAT2b-ACO90188 (0.0132) 
-VvDGAT2-XP_002263626 (0.1313) 

— EaDGAT2-ADF57328.1 (0.1509) 
GrrDGAT2-ACU20344.1 (0.1847) 

— MtDGAT2-ACJ 84867.1 (0.1244) 

Ha DGAT2-ABU50328.1 (0.1788) 

OeDGAT2-ADG22608.1 (0.1585) 

-PtDGAT2-XP_002317635.1 (0.1231) 



Figure 1 Phylogenetic analysis of DGATIs and DGAT2s. The presumed evolutionary relationships among the 1 17 DGATs from 70 organisms 
(listed in Table 1) were analyzed by phylogenetic analysis based on the Neighbor-Joining method of Saitou and Nei. The numbers in the 
parenthesis following DGAT names are the calculated distance values which reflect the degree of divergence between all pairs of DGAT 
sequences analyzed. The sequences above the red line are from animals, whereas the sequences below the green line are from plants. A red 
"star" before the sequence indicates the exceptional sequence from the grouping. 
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Table 2 DGATs properties and amino acid composition 
(% by frequency) 



DGAT1 (n = 55) DGAT2 (n = 54) 



Length (aa) 


515 


± 44 


344 ± 29 


Molecular weight (Da) 


58796 ± 4871 


38920 ± 3330 


Isoelectric point (PI) 


9.17 


± 0.46 


9.28 ± 0.42 


Charge at pH 7 


15.11 


± 5.98 


1 1.34 ± 4.25 


Charged (RKHYCDE) (%) 


26.03 


± 1.27 


26.35 ± 2.02 


Acidic (DE) (%) 


7.20 


± 0.90 


7.38 ± 0.87 


Basic (KR) (%) 


9.97 


± 0.78 


10.47 ± 1.15 


Polar (NCQSTY) (%) 


24.59 


± 2.49 


22.25 ± 2.93 


Hydrophobic (AILFWV) (%) 


42.18 


± 2.25 


40.92 ± 2.70 



Similar to DGATls, these conserved residues are located 
at the carboxyl termini of DGAT2s. Eight of them are con- 
centrated within 25 residues in the highly conserved 
Motifs 4 and 5 of DGAT2s (Figure 7A-B). The first two 
conserved residues (PH) in Motif 1 start until approxi- 
mately 100 residues from the amino termini but the last 
residue (G) ends within the last 50 residues from the car- 
boxyl termini of DGATls (Figure 6A and 7C). 

Sequence analysis of important motifs in less 
conservative regions of DGATs 

Several studies have reported functional motifs in 
DGATs. However, the conserved sequence motifs identi- 
fied by our extensive sequence analysis as discussed 
above do not contain any of the reported putative neutral 
lipid-binding domain [34], mitochondrial targeting signal 
[35] or ER retrieval motif [13]. It was reported that 
mouse DGAT2 contains a consensus sequence 
(FLXLXXX") for a putative neutral lipid-binding domain 
[34] which was shown to be present in proteins that 
either bind to or metabolize neutral lipids [36] . However, 
this putative motif is only modestly conserved in animal 
DGAT2s and not present in any plant DGAT2 (Figure 
8A). Mouse DGAT2 was also reported to contain a puta- 
tive mitochondrial targeting signal with positively 
charged residues (RXKXXK) targeting proteins to mito- 
chondria [35]. This motif is only found in a few animal 
DGATs but not conserved in any of the plant or fungi 
DGAT2s (Figure 8B). DGATs are ER-localized enzymes 
with an ER retrieval motif (LKLEI) at the extreme car- 
boxyl terminus of tung DGAT2 [13]. This sequence ana- 
lysis shows that this pentapeptide ER-retrieval motif is 
only modestly conserved in plant DGAT2s but not in 
animal or fungal DGAT2s, although animal and fugal 
DGATs are also located to ER (Figure 8C). 

Sequence analysis of important amino acid residues in 
less conservative regions of DGATs 

The importance of some less conserved residues of 
DGATs has been demonstrated by site-directed mutants 



(Table 4). Mutagenesis of a putative SnRKl target site 
S197 in Tropaeolum majus DGAT1 results in a 38%- 
80% increase in DGAT1 activity, and over-expression of 
the mutated TmDGATl in Arabidopsis results in a 
20%-50% increase in oil content on a per seed basis 
(Table 4) [25]. Figure 9A shows that this serine residue 
is conserved in most of the plants and some animals 
except that the same position in DGATls alignment is 
replaced with proline, glycine, threonine and lysine resi- 
due in DGATls from other organisms. A similar serine 
residue is not found in any of DGAT2s. Mutagenesis at 
P216 in Tropaeolum majus DGAT1 eliminates almost 
all of the activity (Table 4) [25]. The P216 residue is 
completely conserved in plant DGATls but is missing 
in mammalian DGATls (Figure 9B). 

A highly conserved region with a consensus sequence 
of "YFP" in DGAT2s is essential for enzymatic activity 
of DGAT2 from Saccharomyces cerevisiae (Table 4) 
[33]. These three residues are highly conserved and 
located before Motif 1 but none of the three amino acid 
residues is completely conserved among all DGAT2s in 
our analysis using 54 full-length DGAT2s (Figure 10A). 
The consensus "YFP" is replaced with "YYP" in 
DGAT2s of human, chimpanzee (Pan troglodytes) and 
Ashbya gossypii, "FFP" in Helianthus annuus and Nema- 
tostella vectensis, and "HFP" in caster bean (Ricinus 
communis), Vernonia galamensisand, and Selaginella 
moellendorffii. It was reported that a unique region is 
present in DGAT2 of Saccharomyces cerevisiae [33], but 
in our expanded analysis, a similar region is also found 
in Ashbya gossypii (data not shown). 

Mutations at F80/L81/L83 in mouse DGAT2 [34] and 
F71/L73 in baker's yeast DGAT2 [33] result in partial 
loss of the activity (Table 4). This region is only rela- 
tively conserved in some animal DGAT2s (Figure 10B). 
Finally, ScDGAT2 has a unique cysteine residue (C314) 
which is not involved in catalysis but may be located 
near the active site or related to proper folding of the 
protein [37]. However, this residue is only found in 
DGAT2s from baker's yeast and the other fungi Ashbya 
gossypii and Physcomitrella patens, but is not present in 
the same position of the alignment in any of the other 
51 DGAT2s or any of the 55 DGATls analyzed (Figure 
10C). 

Sequence analysis of important amino acid residues of 
DGATs shown in natural mutants 

The importance of some relatively conserved residues in 
TAG biosynthesis has been demonstrated by two well- 
known natural mutants in corn and cattle. A phenylala- 
nine insertion (F469) in DGAT1-2 increases oil and 
oleic-acid contents in maize. Ectopic expression of the 
high-oil DGAT1-2 allele increases oil and oleic-acid 
contents by up to 41% and 107%, respectively (Table 4) 



Cao BMC Research Notes 201 1, 4:249 Page 8 of 23 

http://www.biomedcentral.eom/1756-0500/4/249 



B 



AaDGATl-XP_0016582^ 
DmDGAT 1-NP_60981 3 
DmDGAT 1-NP_995724 
TcDGATl-XP_975: 
BbDGAT 1-AAZ22403 
BCDGAT1-NP. 



T I -f 



1 11 c 



AD45536 
AAD40881 

AY40785 
AAM0334C 

AV31083 
•A BB843B3 
DQ35668I 



ChDGAT 1-ABD59375 
SsDGATl-NP_999216 
CfDGAT 1 -HP 
CfDGAT 1-XP_85806 
Hi)GATl-NP_03621 
MmDGATl-XP_001090 
PtDGAT 1-XP_S2001 
MmDGATl-NP_034 
RnDGATl-NP_44S889 
OeDGAT l-XPJ) 0272442 7 
M dD GAT 1-XP_001371S65 
DrDGAT 1 -NP_00 100245S 
DrDGATl-NP_956024 
SkDGAT 1-XPJJ02736 160 
AtDGATl-NP_179S35 
BjDGAT 1-AAY407B4 
BnDGAT 
BnDGAT 2. 
BjDGAT 2 
TmDGAT 1 
EaDGAT I 
JcDGATl 
VfDGATl 
RcDGATl-XP_002514132 
PtDGAT 1-XP_002308278 
PtDGAT 1-XP_002330S10 
\l vD GAT 1 -X P _□ □ 227934 5 
GmDGAT 1-AAS78662 
Gm DGAT 1 -BAE9346 1 
LjDGAT 1-AAW51456 
MtDGAT 1-ABN09107 
EpDGATl-ACOSS635 
OeDGAT 1-AAS01606 
PFDGAT 1-AAG23696 
NtDGAT 1-AAF19345 
SmDGAT 1 -XPJ02964 165 
UgDGAT 1-ABV21945 
SbDGAT 1-XPJJ02437 165 

ZmDGAT l-2-EU039S30[ 
PpDGAT l-XPJO 1770929 
OsDGAT 1 -NP_00 1054369 
SbDGAT 1-XP_00243941 9 
CvDGAT 1-EFNS0697 
PtDGAT 1-XPJ021 77753 
DdDGAT 1-XP_645633 
PpDGAT l-EFSSEQij-', 
AcDGAT 2-EGC4 1804 
PbDGAT 1-EEH1717D 
Ao DG A T 2-E E Q3 1 6B 3 
MaDGA T 2-E FY8677 4 
MaDGAT 2-E FY97444 
TgDGAT 1-AAP94209 
AcDGAT 2-X P_0 0322547 7 
Bt DGA T 2-D A A2 1 85 3 
MmDGAT2-NP_08066C 



■o at r ■ 



.012: 



HsD G AT 2-A A Q8889 6 
PaDGAT2-XP_002822304 
XtDGAT2-NP_989372 
DrDGAT 2-NP_Q0 1025367 
IpDG AT 2-NPJO 1188005 
BtDGAT2-XP_875499 
Hi)GAT2-NP_835470 
PtDGAT2-XP_527842 
OaD GAT 2-X P _0 0 IS 1 389 9 
DdDGAT 2-XP_635762 
PpDGAT 2-E FA8364 6 



XP_001630435 
XP_001633322 

.□□1635548 
XP_002633800 

.□□2187643 



NvD'jA T L- 
NvDGAT 2- 
NvD GAT 2 
BtDGAT 2 
TgDGAT 2- 
MdDGi 
aDGAT2-XP_0021 20879 
CeDGAT2-NP 5-05413 
CeDGAT2-NP_872180 
AgDGAT2-NP_9S3542 
ScDGAT2-NP_014888 
SpDGAT2-XP_001713160 
TcDGAT2-XP_975146 
ODGAT2-XPJJ0 1694904 
CrDGAT2-XP_00 1693 189 
CvDGA T 2-E FN5 1 30 6 
OtDGAT2-XP_0030B3539 
PpDGAT 2-XP_00 1758758 
PpDGAT 2-X P_00 1777726 
SmDGAT 2 -XP_0 02 97205 4 
PsDGAT2-ABK262S6 
SPDGAT2-AAQ39S90 
EoDGAT2-AC03S36S 
HvDGAT2-BAJ8573C 
OsD<3AT2-NP_001 057530(1 57; 
OsDGAT2-NP_001 0479 17(1 55 
SbDGAT 2-XP _□ 0245265 2 " 
ZmDGAT 2-PJP_00 1 1 50 174 

AtDGAT2-NP_566952(132; 
BnDGAT 2 a-ACO 90 1 87( 1 34 ' 
BnD GAT 2b-ACO 90 1 88( 1 34 
V vDGAT2-XP_002263626( 154 
EaDGAT 2-ADF57328.lt 148; 
Gm DGAT 2- ACU2034 4 . 1 ( 1 34 
MtD GAT 2-AC3S4 86 7 . 1 [ 1 38 j 
HaDGAT2-ABU5032B.l(153 
OeDGAT 2-ADG22608.1( 152; 
PcDGAT2-XP_002317635.1 (82; 
RcDGAT2-XP_002S2853 1.1(1 57; 
VgDGAT2-ACV40232.1(153; 
VfDGAT2-DG35SS32( 
UiCGAT2A-AAK84179.1[155; 
UrDGAT2B-AAK84180.1(148; 

Consensus! 4 76; 



£151 



rLTXi L ; 



3 FF.3 LVW'ViP 



; T.TLA FI^'AP 



W2 ff,3 la 

in; -ja: -r. " ■■. 

-.SI'QSLA Tl'.VXf 



qp;i pp a r 



QPR PP P3 



QPT PSTTC 



FV FFTZ'P: 



FV IF-.TZ.r.. 



■Tjcroi 

P Y lAT L AGtlfeHP I LB D yBh 3 G I 
FV ATLAGNTPHI LP YLH3GC- 
P Y LAT L A GIITEH p " " LB i 
FY ATLAGWEFHF'LR! 
FY ATLAGirTRHF Li Y1H3G - 
FY ATLAGNFPHF " LP Y HSf-t: 

en atlaghtriie 'lp.l:-,ib3cg: 
IiAtlacwtpl; ? v jicgl-: 

AT AGTJ PL L.P. Y M3C.C-: 



F ARDTIDYIL 
P?7IRH3IDYLL 



CTLVC-Q 



H5IAS3VC 3AF 



3AV Y F..P 



3AV VI F..P 



■ 3AK3LH '■!- 



'3 X.3 TAFL 



3 X.H YAY] 

AT RHFQ3YIi 
KFY3 Yft 
3FY3YX 



r-FTPVTr.iin : 



.trt.h : 



(610' 

AaDGATl -XP_00 1 658299(338; 
DmDGATl-NP_609813.1(419; 
DmDGAT 1 -NP_995724, 
TcDGAT 1-«P_975142 
BbDGATl-AAZ22403 
BtDGATl-NP_77711B 
OaDGAT 1 -M P _□ 0 1 1 03634 
OlD<3ATl-ABD59375 
SsDGATl-NP_999216 
CfDGAT 1-XP_849176 
CfDGAT 1-XP_858062 
HsDGATl-NP_036211 
MmDGATl-XP_00 10901 34 
PtDGATl-XP_520014 
MmDGATl-NP_034176 
RnDGAT 1-NP_445889 
OcD GAT 1 -X P _□ □ 27244 2 7 
MdDGAT 1-XP_001371565 
DrDGAT 1-NP_00 100245B 



SkDGAT 1-XF 
AcDGAT 
BjDGAT 



■NP_179535 
A AY 40784 



AY40785 
■AAH0 3::4iJ 

AU31083 
ABB843B3 
DQ3566B0 



Br,C-iAl 
BjD GAT 
TmDGAT 
EaDGAT 
JcDGAT 
UfDGAT 
BcDGATl-XP_0025 
PtDGAT 1-XP_002308278 
PtDGAT 1-XP_002330510 
VvDGAT 1-XP_002279345 
GmDGATl-AAS78662 
Gm DGAT 1 -BAE9345 1 
LjDGAT 1-AAW51456 
MIDGAT1-ABN09107 
EpDGATl-AC055635 
OeDGAT 1-AAS01606 
PFD GA T 1 -A AG236 9 6 
NtDGAT 1 -A AF1 9345 
SmDGAT 1-XPJ029641 65 
UgDGAT 1-ABU21945 
SbDGAT 1-XP_0024371 65 



NvD'iAT I- 
NuDGAT 2- 
MvDGAT2- 

BtDGAT 2 
TgDGAT 2- 
MdDi 

QDGAT2- 



mDGATl-26U039330[3S3; 

(308; 
2i J5 
(375; 
(284 
.(276; 
2(466; 
1(453; 
1(367: 
1(360; 
1(361 
1(332; 
1(361 
1(390; 
1(266' 
1(246; 
1(273; 
1(25 

1(273; 

1(273; 
1(246] 
1(246; 
1(246] 
3(291 
(226; 
z[ :■: f 

1(132; 
i(2ie; 

1(216' 
1(223; 

1(199; 

1(189' 

1(22 1; 

1(221 
1(221 
1(235; 
1(225; 
1(346; 
1(303] 
1(230; 
1(226] 
1(221 

1(216; 
i(i3s; 



PpDGAT 1-XPJ01770929 
OsDGAT 1-NP_001054869 
SbDGAT 1-XP_00243941 9 
CvDGATl-EFN50697 
PtDGAT 1-XPJ02177753 
DdDGAT 1-XP_645633 
PpDGAT 1 -E FA 3 EGG -I 
AcD GA T2-EGC4 1 B □ 4 
PbDGATl-EEH17170 
Ao DG A T 2-E EQ3 1 6 B 3 
MaDG A T 2 -E FY 867 7 4 
MaDG A T 2 -E FY 9 74 4 4 
TgDGAT 1-AAP94209 
AcDGAT 2-XPJ03225477 
Bt D GAT 2-D AA2 1853 
MmDGAT2-NP_080660 
RnDGAT 2-NP_00 10 12345 
HsDGAT2-AAQ88896 
PaD GAT 2-X P _00 28223 □ 4 
XtDGAT2-NP_989372 
DrDGAT 2-MP_00 1025367 
IpDGA T 2-N P _0 0 U 880 0 5 
BtDGAT2-XP_875499 
HsDGAT2-NP_S3E47G 
PCDGAT2-KP_527B42 
OaDGAT 2-XPJ0 15 1 8899 
DdDGAT 2-XP_635762 
PpDGAT2-EFA83646 



XP_001630435 
XP_001633322 
.001635548 
XP_0 026838 00 
□02187643 



.□□2120B79 
CeDGAT2-NP_505413 
CeDGAT2-NP_8721B0 
AgDGi 



ScDGAT2-NP_014B88 
SpDGAT2-XP_001713160 
TcDGAT2-XP_975146 
ODGAT2-XP_00 1694904 
G-DGAT2-XP_00 16931 89 
CvDGAT2-EFN513Q6 
OtDGAT2-XP_003083539 
PpDGAT 2-X P_0 01 7587 58 
PpDGAT 2-X P_00 1777726 
SmDGAT 2-X P_0 029720 5 4 
PsDGAT2-ABK26256 
SpDGAT2-AAQ89590 
EoDGAT2-AC035365 
HvDGAT2-BA385730, 
OsDG A T2 -NP_00 1 05 7530[ 23 7\ 
OsDGAT2-NP_00 104791 7(235 
SbDGAT 2-X P_0 024526 5 2. 1(23 1; 
ZmDGAT 2-MP_00 1150174.1(230; 
AtDGAT2-NP_5&6952(212; 
B n DG A T 2 a-ACO90 187( 2 1 4 
BnDGAT2b-ACO90 188(21 4 
V vDGAT 2 -XP _00 2 26 3626( 234 
EaD GA T2 -A DF573 2 8 , 1 [ 228; 
GmDGAT2-ACU203' 
MtDGAT2-AC384867.1(213; 
HaDGAT2-ABU5032B. 1(233" 
OeDGAT 2-A DG226 08. 1(232] 
PtDGAT 2-XP_0023 17535.1(162; 
RcD GAT 2-X P_0 025285 3 1.1(237] 
VgDGAT 2- ACU 40232.1(235; 
U fD G AT 2 -DG 3S6632( 2 1 
UiOGAT2A-AAK841 79. 1(238; 
UrDQAT2B-AAK84180.1(230; 

Consensu s(610; 




r-.:-EF-.EF'.T. 



F GE P.E FTK ITOI 



HFT-DP.EFY-J-E' 



jLRFGDB3FYT» 
.RFGDR.-FYlB 
rrPFGDR JFYLD' 
■PIY3 F 



L"PAYTFGnTFAYKVHJR 
Jir PTFCFGgTNCYFXJHR 
1," PT FC F pg RI1ATTRKWK 



- TF.AYRWW. 

FGgi:EA|- J uron 

FC 3 3"-' ' T- f'T ' T. 



FC ■■ 3YA . ..T,I P 

jgsYVYWirraR 

FAFGgsYVTKraStE 



rFGgAHTyKtffiirR 



AaDGAT 1 -XP_00 1 6 582 99( 
DmDGAT 1 -NP_6098 13. 
DrnDGATl-NP_99572. 
TcDGAT 1-XP_975142. 
BbDGAT 1-AAZ22403, 
BtDGATl-NP_777118, 
OaDGATl-MP_00110363' 
ChDGAT 1-ABD59375. 
Si>GATl-NP_999216, 
CfDGAT 1-XP_3491 76, 
CfDGAT 1-XP_35306; 
HsDGATl-NP_03621 
MmDGATl-XP_00109013' 
PtDGAT 1-XP_52001' 
MmDGATl-NP_034176, 
RnDGAT 1-NP 445889, 
OcDGAT 1-XP_002724427, 
MdDGAT 1-XP_001371565, 
DrDGAT 1-NP_00 100245B, 

SkDGAT 1-XP_00 27361 61 
AtDGATl-NP_179535, 
BjDGAT 1-AAY4078' 



AAD4088: 

.AY40785. 
AAM03340, 
31083. 
ABB84383. 
DQ3566BI 



BnD'iAl L- 
BjDGAT 2- 
TmDGAT 1- 
EaDGAT 1 
JcDGAT 1 
UfDGAT 1 
RcDGATl-XP_00251 
PtDGAT 1-XP_0 0230327: 
PtDGAT 1-XP_00233051I 
UuDGAT 1-XP_002279345, 
GmDGAT 1-AAS78662. 
Gm DGAT 1-BAE9346 
LjDGAT 1-AAW51456. 
MtDGATl 
EpDGATl-AC055635, 
OeDGAT 1-AAS01606, 
PFDGA T 1 -A AG236 9 6 , 
NtDGAT 1 -A AF1 9345, 
SmDGAT 1-XPJ029641 65. 

UgDGAT 1-ABW21945, 
SbDGAT 1-XP_0024371 65, 



ZmD' 



PpDGAT 1-XPJ01770929. 
OsDGAT 1-MP_001054B69, 
SbDGAT 1-XP_0024394 19, 
CvDGATl-EFM50697, 
PtDGAT 1-XPJ021 77753, 
DdDGAT 1-XP_645633, 
PpDGAT 1-EFA350 1 
AcDGAT 2-E GC4 181 
PbDGAT 1-EEH 1 71 1 
AoDGAT 2-E EQ3 1683, 
MaDGAT 2-E FY 3677' 
MaDGAT 2-E FY 9744' 
TgDGAT 1-AAP9420' 
AcDGAT 2-XPJ03225477, 
BtDGAT 2-D AA218E3, 
MmDGAT2-NP_08066l 
RnDGAT 2-MP_00 10 1234! 

HsDGAT2-AAQ88896, 
PaDGAT2-XP_00282230' 

XtDGAT2-NP_989372, 
DrDGAT 2-NP_00 1025367, 
IpDGAT2-NP_00 11 88005, 
BtDGAT 2-XP_875499, 
HsDGAT2-NP_33547l 
PtDGAT 2-XP_527842, 
OaD GAT 2-X P _00 15 1 88 9 9 , 
DdDGAT 2-XP_635762, 
PpDGAT2-EFA33646, 



001630435, 
P_001633322, 
P_00163554B, 



N»C"iATj- 
NuD GA T 2- 
NuD GAT 2- 
BtDGAT2- 
TgDGAT2-XP_0021876' 
MdDGAT2-XP_00 1365685, 
ODGAT2-XP_0021 20879, 
CeDGAT2-NP_505413, 
CeDGAT2-NP_872180, 
AgDGAT2-NP_983542, 
ScDGAT2-NP_014888, 
SpDGAT2-XP_001713160, 
TcDGAT2-XP_975146, 
ODGAT2-XP_00169490' 
ODGAT2-XP_00 16931 89, 
CvDGAT2-EFN51306, 
OtDGAT2-XP_003083539, 
PpDGAT2-XP_00 1753751 
PpDGAT 2-X P 00 1777726, 
SmD GAT 2-X P _□ □ 29720 5 4 , 
PsD GA T2 -A BK262 5 6 , 
SpDGAT2-AAQ89S9l 
EoDGAT 2-AOD35365, 
H uDG AT2-B AJ857 3 1 
OsDG A T 2 -NP_00 1 05 75 30( 
OsDG AT2 -NP_00 1 04 7 9 17( 
SbDGAT 2-XP_002452652. 
ZmDGAT 2-NPJDO 1150174. 

AtDGAT2-NP_566952( 
B PI DG A T 2 a-ACO 90 1 37( 
BnDG AT2b-ACO90 1 8 
U vDGAT 2-XP_0022635i 
EaD GA T2 -A DF5 73 2 8 , 
GmDGAT 2-A OJ20344, 
MtDGAT2-ACJ84867, 
HaDGAT2-ABU50328, 
OeDGAT 2-ADG22608, 
PtDGAT 2-XP_0023 17635, 
RcDGAT2-XP_002523S31, 
UgDGAT 2-ACV 40232. 
V/fDGAT2-DG3S6632( 
UrDGAT2A-AAK84179. 
UrDG AT 2B-AAK84 180, 



Y V.-IPLP.HFl 

Y . J IF LHHFI 





PFEKPVTT 
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•ATPLHT* 
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Figure 2 Conservation of proline and phenylalanine residues in all DGATs. Multiple sequence alignment was performed using the ClustalW 
algorithm and 1 17 DGAT protein sequences from 70 organisms (listed in Table 1). DGAT sequence name is on the left of alignment followed by 
the start of the amino acid residue of each DGAT protein sequence. The completely conserved proline and phenylalanine residues are 
highlighted in red on yellow. Other color code and related information are described in "Methods" section. (A) Motif 1 (P Block): the completely 
conserved proline residue in all DGATs, (B) Motif 2 (FG Block): the completely conserved tryptophan residue followed by a highly conserved 
glycine residue in all DGATs, (C) Motif 3 (P-1 Block): the almost completely conserved proline residue except one in all DGATs. 
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Table 3 The completely conserved residues in DGATs using examples of mouse, tung tree and Dictyostelium 
discoideum 



Organism 



DGAT1 



DGAT2 



Mouse 
(animal) 



1 MGDRGGAGSS RRRRTGSRVS VQGGSGPKVE 
EDEVRDAAVS PDLGAGGDAP 

51 APAPAPAHTR DKDGRTSVGD GYWDLRCHRL 
QDSLFSSDSG FSNYRGILNW 

101 CWMLILSNA RLFLENLIKY GILVDPIQW 
SLFLKDPYSW PAPCVIIASN 

151 IFWAAFQIE KRLAVGALTE QMGLLLHWN 
LATIICFPAA VALLVESITP 

2 01 VGSVFALASY SIMFLKLYSY RDVNLWCRQR 
RVKAKAVSTG KKVSGAAAQQ 

251 AVSYPDNLTY RDLYYFIFAP TLCYELNFPR 
SPRIRKRFLL RRVLEMLFFT 

301 QLQVGLIQQW MVPTIQNSMK PFKDMDYSRI 
IERLLKLAVP NHLIWLIFFY 

351 WFFHSCLNAV AELLQFGDRE FYRD WWN AES 
VTYFWQNWNI PVHKWCIRHF 

401 YKPMLRHGSS KWVARTGVFL TSAFFHEYLV 
SVPLRMFRLW AFTAMMAQVP 

451 LAWIVGRFFQ GNYGNAAVWV TLII GQP VAV 
LMYVHDYYVL NYDAPVGV 



1) G96, 
P270, 



2) L105, 3) K216, 4) S219, 5) R228, 6) 



1 MKTLIAAYSG VLRGERRAEA ARSENKNKGS 
ALSREGSGRW GTGSSILSAL 
51 QDIFSVTWLN RSKVEKQLQV ISVLQWVLSF 
LVLGVACSVI LMYTFCTDCW 

101 LIAVLYFTWL AFDWNTPKKG GRRSQWVRNW 
AVWRYFRDYF PIQLVKTHNL 

151 LTTRNYIFGY HPHGIMGLGA FCNFSTEATE 
VSKKFPGIRP YLATLAGNFR 

201 MPVLREYLMS GGICPVNRDT IDYLLSKNGS 
GNAIIIWGG AAESLSSMPG 

251 KNAVTLKNRK GFVKLALRHG ADLVPTYSFG 
ENEVYKQVIF EEGSWGRWVQ 

301 KKFQKYIGFA PCIFHGRGLF SSDTWGLVPY 
SKPITTWGE PITVPKLEHP 

351 TQKDIDLYHA MYMEALVKLF DNHKTKFGLP 
ETEVLEVN 

1) P162, 2) H163, 3) P202 , 4) R205, 5) G239, 

6) G240, 

7) E243, 8) R259, 9) G261, 10) F262, 11) A2S6, 

12) V274, 

13) P275, 14) F279, 15) G2 8 0 , 16) E339 



7) T271, 

12) W345, 

13) L346, 



R285, 9) Q308, 10) P313, 11) L337, 



14) F34E 



F366 , 18) D368, 

19) R369, 20) F371, 

W376, 24) N377, 

25) W3 85, 26) W3 88, 



15) F353, 
21) Y372, 



16) E362, 
22) W375, 



P403, 30) 
31) S422, 



F419, 
32) H426, 



Q448, 36) L451, 
37) N465, 38) G475, 
Y483 



27) N389, 28) P391, 
33) P433, 34) F442, 
39) Q476, 40) P477, 



17) 
23) 
29) 
35) 
41) 



Tung tree 
(plant) 



1 MTILETPDNS TDATTSGGAE SSSDLNLSLR 
RRRTASNSDG AVAELASKID 

51 ELESDAGGGQ VIKDPGAEMD SGTLKSNGKD 
CGTVKDRIEN RENRGGSDVK 

101 FTYRPSVPAH RALKESPLSS DNIFKQSHAG 
LFNLCIWLV AVNSRLIIEN 

151 IMKYGWLIKT GFWFSSRSLR DWPLLMCCLT 
LPIFSLAAYL VEKLACRKYI 

2 01 SAPTWFLHI LFSSTAVLYP VSVILSCESA 
VLSGVALMLF ACIVWLKLVS 

251 YAHTNFDMRA IANSVDKGDA LSNASSAESS 
HDVSFKSLVY FMVAPTLCYQ 

301 PSYPRTASIR KGWWRQFVK LIIFTGFMGF 
IIEQYINPIV QNSQHPLKGD 

351 LLYAI ERVLK LSVPNLYVWL CMFYCFFHLW 
LNILAELLRF GDREFYKDWW 

401 NARTVEEYWR MWNMPVHKWM VRHIYFPCLR 
HKI PRGVALL ITFFVSAVFH 

451 ELCIAVPCHI FKLWAFIGIM FQIPLVGITN 

YLQNKFRSSM VGNMIFWFIF 

501 CILGQPMCLL LYYHDLMNRK GTTESR 

1) G130, 2) L139, 3) K247, 4) S250, 5) R259, 

6) P295 , 

R310, 9) Q334, 10) P338, 11) L361, 



1 MGMVEVKNEE EVTIFKSGEI YPTNIFQSVL 
ALAIWLGSFH FILFLVSSSI 
51 FLPFSKFLLV IGLLLFFMVI PINDRSKLGQ 
CLFSYISRHV CSYFPITLHV 

101 EDINAFRSDR AYVFGYEPHS VFPIGVMILS 
LGLIPLPNIK FLASSAVFYT 

151 PFLRHIWSWC GLTPATRKNF VSLLSSGYSC 
ILVPGGVQET FYMKQDSEIA 

201 FLKARRGFIR IAMQTGTPLV PVFCFGQMHT 
FKWWKPDGEL FMKIARAIKF 

251 TPTIFWGVLG TPLPFKNPMH WVGRPIEVK 

QNPQPTAEEV AEVQREFIAS 

301 LKNLFERHKA RVGYSDLKLE IF 

1) P118, 2) H119, 3) P15 1, 4) R154, 5) G185, 

6) G186, 

7) E189, 8) R205, 9) G207, 10) F208, 11) A212, 

12) V220, 

13) P221, 14) F225, 15) G226, 16) E274 



7) T296, 

12) W369, 

13) L370, 



14) F373, 15) F377, 16) E386, 
D392 , 

19) R393, 20) F395, 21) Y 



F390, 18) D392, 



W400, 24) N401, 
25) W4 09, 26) W412 



5, 22) W399, 
27) N413, 28) P415, 



17) 

23) 
29) 



P427, 30) 
31) S446, 
Q472, 36) 
37) N4 93, 
Y512 



F443 , 
32) H450, 
L475 , 
38) G504, 



33) P457, 34) F466, 35) 



39) Q505, 40) P506, 41) 
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Table 3 The completely conserved residues in DGATs using examples of mouse, tung tree and Dictyostelium discoi- 
deum (Continued) 



Dictyostelium 


1 MEPIPPSNGN KNNSMDKQPQ QPQQPQQQQQ 




1 MVRFVPWNVP LYRRLETMAV AIYAMVLPVC 


discoideum 


QQQQQRRDQR NSKLNELNET 




LIMAFNLIVI PLFWGIAIPY 


(fungus) 


51 ERVRNRFISH EFHKLDRTKS RIDAPKISFS 




51 LVWMFYFDTK HESGGRRVSL VRNSILWRYF 




DSESESDSEF FLAKRNTNNN 




RDYFPISLII NSNYDPKKNY 




101 NQNNTSPTFS SANGKQSNLT QRKINTQIQS 




101 IFAYHPHGII SIGAFCNFAT NANNIDEKLP 




KQPTNNNVQP LTDDEGTINH 




GLKVHLLTLE SNFKIPFLRD 




151 SNHHHHHHNQ NNNGNNNNNN NNNNNNNKIS 




151 VLMSFGMSSV SKKSCENILN SGAGESICLV 




TPPKQEEKMT MNGLFTLRPS 




VGGAEESLDA RPGLNEITLK 




2 01 ILSSESNGSS YRGFLNLLLI LLITASFRLV 




2 01 KRKGFIKLAL VNGASLVPVY SFGENDIYDQ 




ILNHLLYGIR INLDLYKISE 




VPNPRGSLVR KIQTKIKDLT 




251 YHRWPGVMIS LMINLFIIAA YLIEKAAAKQ 




251 GIAPPLFMGR GIFNYDFGLL PVRHKIVTW 




LiLPDRlCYLL R11JNLAAV11 




GhjPlJJlPK-lK SPlJJyVlEHY 




301 VPSGSIIAFS PNPASGIIVM ILICTFSMKI 




301 HQIYVEALQN LFDKHKNSCA DKETGNLKIN 




ISYAYENSKQ RKLNPDNKKF 




1) P106, 2) H107, 3) P146, 4) R149, 5) G182, 




351 VIDPTNTSIY PNNLSLRSTY WFMLVPTLVY 




6 ) G183 , 




QLSYPRSPKI RKGYLLRRIV 




7) E186, 8) R202, 9) G204, 10) F205, 11) A209, 




401 EALSLSLLIL WMVNQYMLPL VQNSIEPLEK 




12) V217, 




IDIVLIVERI MKLSLPNLYV 




13) P218, 14) F222, 15) G2 23, 16) E281 




451 WLLGFYVFFH LYLNIVAEIT RFGDREFYRD 








WWNSTGLDYF WRTWNMPVHH 








501 WMWLIYTPM RRRGFSKNMG YFMCFFVSAI 








FHELVISIPF HSLKLWGFFG 








551 IMSQMVLIAL TKNLMNGRNL GNVIFWISIV 








LGQPLWLLY YRNFVLENPE 








601 WYRNVEPPTS PPVMPFY 








1) G213, 2) L222, 3) K329, 4) S332, 5) R341, 






6) P376, 








7) T377, 8) R391, 9) Q415, 10) P419, 11) 


L443, 






12) W451, 








13) L452, 14) F455, 15) F459, 16) E468, 


17) 






F472, 18) D474, 








19) R3475, 20) F477, 21) Y3478, 22) W48 


1, 23) 






W482 , 








24) N483, 25) W491, 26) W494, 27) N495, 


28) 






P497, 29) P509, 








30) F525, 31) S528, 32) H532, 33) P539, 


34) 






F548, 35) Q554, 








36) L557, 37) N572, 38) G582, 39) Q583 , 


40) 






P584, 41) Y590 







The completely conserved residues in DGATIs and DGAT2s are underlined and listed below the sequences. The underlined P and F residues are conserved in all 
DGATs. 



[38]. This phenylalanine residue is conserved in all 
plants except Brassica napus [GenBank:AAD45536.1] 
and conserved in all fungi except mold [Dictyostelium 
discoideum and Polysphondylium pallidum) (Figure 
11 A). Rape DGAT1 is the only sequence in the plant 
group with a serine residue in the place of phenylalanine 
in the sequence alignment (Figure 11A). Since this gene 
is isolated from suspension cultures of Brassica napus 
[39] and the native form is not available, it is not known 
if this replacement was caused by mutations due to cell 
culture conditions, considering that cell culture could 
cause significant changes in gene expression [40]. It is 
interesting to note that a similar phenylalanine residue 
is not present in any of the animal DGATIs or any of 
DGAT2s. 

In cattle, a nonconservative substitution of lysine by 
alanine (K232A) in DGAT1 (Table 4) is directly respon- 
sible for the quantitative trait loci (QTL) variation with 
the lysine-encoding allele being associated with higher 



milk fat content [41]. Figure 11B shows that the lysine 
residue is conserved in mammalian DGATIs but not in 
plants, or fungi, or other animals (fly, frog, insect and 
worm) except one of the two forms from dog and zebra- 
fish. The wild-type cattle DGAT1 shows the normal 
lysine at 232 position [GenBank:AAL49962.1] (Table 4). 
A similar lysine residue is not found in any of DGAT2s. 

Discussion 
DGAT classification 

The nomenclature of proteins derived from DNA 
sequences in GenBank databases can lead to confusion 
in some cases. One of the utilities of this extensive 
sequence analysis is to use the completely conserved 
amino acid residues in respective sequence blocks of 
DGAT1 and DGAT2 subfamilies as signatures of DGAT 
proteins for classification. It is generally accepted that 
DGATs are divided into DGAT1 and DGAT2 subfami- 
lies. However, more than two forms of DGATs are 
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(288) ± 

DmDGATl-NP_609813.1{120) F. 
DmDGATl-NP_995724.1{170) e 
TcDGAT1-XP_975142.1 (39) f 
BbDGATl-AAZ22403.1 (66) E 
BtDGATl-NP„777118.2 (66) E 
OaDGATl-NP_001103634.1 (66) E 
SsDGATl-NP_999216.1 (66) E 
CfDGATl-XP_849 176.1 (66) E 
CfDGATl-XP_858062.1 (66) E 
HsDGATl-NP_036211.2 (69) E 
MmDGATl-XP_001090134.1 (72) E 
PtDGATl-XP_520014.2 (69) E 
MmDGATl-NP_034176.1 (78) E 
RnDGATl-NP_445889.1 (78) E 
OcDGATl-XP_002724427.1 (85) E 
MdDGATl-XP_001371565. 1(210) E 
DrDGATl-NP_001002458.1 (77) E 
DrDGATl-NP_956024.1 (69) E 
SkDGATl-XP_002736160.1 (46) E 
DdDGATl-XP_645633.2(195) E 
PpDGATl-EFA85004. 1(183) E 
AcDGAT2-EGC41804.1 (69) £ 
PbDGATl-EEH17170.1 (62) 3 
AoDGAT2-EEQ31683.1 (63) £ 
MaDGAT2-EFY86774.1 (64) 3 
MaDGAT2-EFY97444.1 (64) 3 
TgDGATl-AAP94209. 1(104) E 
CvDGATl-EFN50697.1 (12) I 
PpDGATl-XP_001770929.1 (34) F 
SmDGATl-XP_002964165.1 (9) F 
OsDGATl-NP_001054869.2(125) F 
SbDGATl-XP_002439419. 1(102) F 
SbDGATl-XP_002437165.1 (76) F 
ZmDGATl-2-EU039830 (80) F 
VgDGATl-ABv21945.1{110) F 
EpDGATl-AC055635.1 (60) F 
OeDGATl-AA501606.1(119) F 
PfDGATl-AAG23696. 1(121) F 
NtDGATl-AAF19345. 1(123) F 
AtDGATl-NP_179535. 1(115) F 
BjDGATl -A AY 40784.1 (98) F 
BnDGATl-AAD45536.1 (98) F 
BjDGAT2-AAY40785.1 (98) F 
TmDGATl-AAM03340.2(107) F 
EaDGATl-AAV31083.1 (94) F 
GmDGATl-AA578662.1 (85) F 
GmDGATl-BAE93461.1 (91) F 
LjDGATl-AAW 5 1456.1 (98) F 
MtDGATl-ABN09107.1{125) F 
VvDGATl-XP_002279345. 1(102) F 
jcDGAT1-ABB843S3. 1(106) F 
VfDGATl-DQ356680.1(lll) F 
RcDGATl-XP_002514132.1(lll) F 
PtDGATl-XP_002308278.1 (37) F 
PtDGATl-XP_002330510.1 (78) 3 
Consen5us(288) 




SH GL NLC WL A N RLII] 



(447) 

DmDGATl-NP_609813. 1(259) 
DmDGATl-NP_995724. 1(309) 
TcDGATl-XP_975142. 1(178) 
BbDGATl-AAZ22403. 1(202) 
BtDGATl-NP_777118.2(202) 
OaDGATl-NP_001103634.1(202) 
SsDGATl-NP_999216. 1(202) 
CfDGATl-XP_849176. 1(202) 
CfDGATl-XP_858062. 1(202) 
HsDGATl-NP_036211. 2(205) 
MmDGATl-XP_001090134. 1(208) 
PtDGATl-XP_520014.2(205) 
MmDGATl-NP_034176. 1(214) 
RnDGATl-NP_j445889. 1(214) 
OcDGATl-XP_002724427. 1(221) 
MrJDGATl-XP_001371565. 1(346) 
DrDGATl-NP_001002458. 1(213) 
DrDGATl-NP_956024. 1(205) 
SkDGATl-XP_002736160. 1(184) 
DdDGATl-XP_645633.2(327) 
PpDGATl-EFA85004.1(315) 
AcDGAT2-EGC41804.1(226) 
PbDGATl-EEH17170. 1(219) 
AoDGAT2-EEQ31683. 1(220) 
MaDGAT2-EFY86774. 1(191) 
MaDGAT2-EFY97444. 1(220) 
TgDGATl-AAP94209. 1(239) 
Cv DGAT 1-EFN50697. 1(141) 
PpDGATl-XP_001770929. 1(168) 
SmDGATl-XP_002964165. 1(141) 
OsDGATl-NP_001054869. 2(259) 
SbDGATl-XP_002439419. 1(236) 
SbDGATl-XP_002437165. 1(210) 
ZmDGATl-2-EU039830(214) 
VgDGATl-ABV21945. 1(244) 
EpDGATl-AC055635. 1(194) 
OeDGATl-AAS01606. 1(253) 
PfDGATl-AAG23696. 1(255) 
NtDGATl-AAF19345. 1(257) 
AtDGATl-NP_179535. 1(249) 
BjDGATl-AAY40784.1(232) 
BnDGATl-AAD45536. 1(232) 
BjDGAT2-AAY40785. 1(232) 
TmDGATl-AA MO 3340. 2(241) 
EaDGATl-AAV31083. 1(228) 
GmDGATl-AAS78662. 1(219) 
GmDGAT 1-BAE93461. 1(225) 
LJDGAT1-AAW51456. 1(230) 
MtDGATl-ABN09107. 1(259) 
VvDGATl-XP_002279345. 1(236) 
JCDGAT1-ABB84383. 1(240) 
VfDGATl-DQ356680. 1(245) 
RcDGATl-XP_002514132. 1(245) 
PtDGATl-XP_002308278. 1(171) 
PtDGATl-XP_002330510. 1(212) 
Consensu s(447) 



7 :.:': w yvqtnmi 



7 :::cr yfcvnlwc/E 

ALKL F 3 YRDVN LWC PE 
ALKL F 3 YRDVN LWC PE 
fLKIi FS YRDVN LWC p E 
fLKL F 3 YRD VN LWC' PE 
fLKL F 3 YRDVN LWC RE 
rLKLFSYRDvjsKCR- 
ALKL FS YRDVN LWC R- 
ALKLFSYRDVflSWCR- 
fLKL Y 3 YRDVN LWC' F .Q 
ALK1S3YRDVNLWCP.Q 
ALKL.F3YRDVNLWCP.Q 
ALKf-IVAYiGVNLWCPE 
r LKLY 3 YKDVMKWC RE 
AL?;LY 3 YTDVHKWCRE 
fLKL I SYVLVNPWC - C 
MKI I SYAYeBsKQRK 
|NKQRH 




MI E 

WLKLVSYAHTNADI RA 
WLKLVS FAHTNHDI RQ 
WLKLVS FAHTNHDI RK 
WMKLVSYAHTNYDIRA 
YAHTNYDIRV 
FVHTNYDMRS 
WLKLVSYAHTNYDMRA 
WLKLVS YAHAS HDMRA 
WLKLV S YAHTN Y DLRV 
K'-' : ! 1 V YAH TNH DMRQ 
WLKLVS YAHT S Y IRS 
WLKLV S YAHT S Y IRT 
WLKLV S YAHT S Y IRT 
WLKLV S YAHTN Y IRT 
WLKLV 3YAHTSS 
WLKLV S YAHTN Y 
WLKLV j'YAHTNY 

wlklvsyahtny mra 
wlklv s yahtn y dmre 
wlklvsyahttydmra 
wlklv s fahtn ydmra 
wlklv s yahtn ydmra 
wlklv s yahtn f dmra 
wlk] v yahtnydm1 a 

wlklvs yahtn d r 



IRT 



: :: a 



(509; 

DmDGATl-NP_609813. 1(312; 
DmDGATl-NP_995724. 1(362; 
TcDGATl-XP_975 142. 1(239; 
BbDGATl-AAZ22403. 1(243; 
BtDGATl-NP_777118. 2(243; 
OaDGATl-NP_001103634. 1(243; 
SsDGATl-NP_999216. 1(243; 
CfDGATl-XP_849176. 1(252; 
CfDGATl-XP_8 58062. 1(233; 
HsDGATl-NP_0 362 11. 2(242; 
MmDGATl-XP_001090134. 1(245; 
PtDGATl-XP_520014. 2(242; 
MmDGATl-NP_034176. 1(253; 
RnDGATl-NP_ 445889. 1(253; 
OcDGATl-XP_002724427. 1(257; 
MdDGATl-XP_00 1371565. 1(385; 
DrDGATl-NP_001002458. 1(258; 

DrDGATl-NP_9 56024. 1(251 
SkDGATl-XP_002736160. 1(235; 
DdDGATl-XP_645633.2(359; 
PpDGATl-EFA85004. 1(346; 
AcDGAT2-EGC41804.1(260; 
PbDGATl-EEH17170. 1(253; 
AoDGAT2-EEQ31683. 1(254; 
MaDGAT2-EFY 86774. 1(225; 
MaDGAT2-EFY97444. 1(254; 
TgDGATl-AAP94209. 1(277; 
O/DGAT1-EFN50697. 1(177; 
PpDGATl-XP_00 17 70929. 1(202; 
SmDGATl-XP_002964165. 1(174; 
OsDGATl-NP_001054869. 2(292; 
SbDGATl-XP_002439419. 1(269; 
SbDGATl-XP_002437165. 1(243; 
ZmDGATl-2-EU039830(247; 
VgDGATl-ABV21945. 1(277; 
EpDGATl-AC055635. 1(227; 
OeDGATl-AAS01606. 1(286; 
PfDGATl-AAG23696. 1(288; 
NtDGATl-AAF19345. 1(286; 
AtDGATl-NP_179535. 1(275; 
BjDGATl -AAY40784. 1(258; 
BnDGATl-AAD45536. 1(258; 
B]DGAT2-AAY40785. 1(258; 
TmDGATl-AAM03340. 2(274; 
EaDGATl-AAV31083. 1(262; 
GmDGATl-AAS78662. 1(253; 
GmDGAT 1-BAE93461 . 1(259; 
LJDGAT1-AAW51456. 1(264; 
MtDGATl-ABN09107. 1(293; 
VvDGATl-XP_002279345.1(270; 
JcDGATl-A BB84383. 1(274; 
VfDGATl-DQ356680. 1(279; 
RcDGATl-XP_002514132. 1(278; 
PtDGATl-XP_002308278. 1(205; 
PtDGATl-XP_002330510. 1(246; 

Consensus(509; 



LTYKI 
LNLRE 
LTYRI 
LTYRI 
LTYRI 
LTYRI 
LTYRI 
SI PSLPPAI 
LTYRI 
LTYRD 
LTYRD 
LTYRD 
LTYRD 
LTYRI 
ANLTYQD 
G LSLRDIY FVF 
G LTHRDIY AF 
D LKLKDLY 
NNLSLRSTY 
N LSLSNTYWYML 
III] 
VTH 

r. vtv;;i 

NNITFGN 
NNITFGN 
YSITLRH 
ENIVAAN 

dhltiqnl 

dnislkn i a 
dnlq p ptlgnli"! 
dnlqaptlgsIti 

ESMKDPTFKf 

enmkdptfk: 
dyfydvnfki 
dcsfevkfqa 
ddsygasfq! 
dyaydvsfk; 
dfsydvsfk; 
evsyyvslk: 
eisyyvslk: 
eisyyvslk: 
evsyyvslk: 
scsydvslk; 
-ysfdvnik; 
-ypynvsfk; 
-yi ynvsfk; 
-ysydvsfk; 
eyphnvsfq; 
-ypydvnfk; 

-3SRDVSFK; 

-sshdvsfk: 

- fCHDVSFKT 
-NI YCTY7K/ ; - 
-HSYDAKFK. 

YP N 



T 



TYKD L LcAl r LC YE L K F P R T 3 RVRKRF L L KR 



I 

w 3h 



F.rLLKP 
F.FLLKR 
EFLL R 
EFLL R 
EFLL R 
EFLL R 

Irfll r 

Irfll r 

Irfll r 

Irfll r 

Irfll r 

Irfll r 

Irfll r 

efll r 

efll r 

[GFLL P. 
RJLL P 

|rfli r 
Iyll r 



TD? .WSJ' 
TDK .WSF 



VJKP 
'VFKR 

klsvie:-: 



■ PR I RT G WVLRQ 
'RTSCVRKGWLIRQ 
RT] 



.S DTI RKSWVLRQ 

.spr i rt g wvlrq 
EiIq 

LVRQ 
P 'TRQ 
'TQQ 
LFC, 
r TRH 
'RTSCIRKGWWRQ 
'RTACIRKGWWRQ 
HTPCIRKGWVARQ 



'L7 2 
'L7 2 



L YF SPTLCYQ YPRSP IRKGWV R 



Figure 3 Identification of completely conserved amino acid residues in sequence motifs 1-3 of DGATIs (A) DGAT1 -Motif 1 (GL Block; 
The boxed E residue is mutated in TmDGATI), (B) DGAT1 -Motif 2 (KSR Block), (C) DGAT1 -Motif 3 (PTR Block; The boxed P residues are also 
conserved in DGAT2s). Multiple sequence alignment was performed using 55 full-length DGAT1 protein sequences from 45 organisms (listed in 
Table 1). The completely conserved amino acid residues are highlighted in red on yellow. Other color code and related information are 
described briefly in Figure 2 legend and with details in "Methods" section. 



present in a number of species (Table 1). Phylogenetic 
analysis and multiple sequence alignment classify all 117 
DGAT protein sequences into DGAT1 and DGAT2 
subfamilies (Figures 1, 3, 4, 5, 6, 7). Furthermore, six 
DGATs currently designated as DGAT2s in the data- 
bases are reclassified here as DGATIs due to their close 
alignment with DGATIs and these sequences containing 
all 41 completely conserved amino acid residues found 
in DGATIs. These six reclassified sequences are [Gen- 
Bank:EGC41804.1, GenBank:EEQ31683.1, GenBank: 
AAY40785.1, GenBank:AAD40881.1, GenBank: 
EFY86774.1 and GenBank:EFY97444.1] (Table 1). 

DGATs which diverge from the DGAT1 and DGAT2 
subfamilies have been recently reported from Arabidop- 
sis thaliana [GenBank:AAN31909.1] [16], Burning Bush 
(Euonymus alatus) [GenBank:GU594061.1] [14], peanut 
(Arachis hypogaea), [GenBank:AY875644.1] [15], caster 
bean (Ricinus communis) [GenBank:XP_002519339.1] 
and yeast {Rhodotorula glutinis) [GenBank:DG315417.1]. 
Phylogenetic analysis and multiple sequence alignment 



indicate that these sequences are completely different 
from DGATIs and DGAT2s. In fact, none of the com- 
pletely conserved residues in DGATIs (41 residues) and 
DGAT2s (16 residues) is present in these new DGATs 
in the multiple sequences alignment (data not shown). 
This sequence divergence is in contrast to the general 
belief that the active sites of enzymes should be con- 
served during the evolution because all catalyze the 
same/similar biochemical reaction. Therefore, this 
sequence divergence raises an important question how 
completely different proteins could perform the same 
biochemical reaction. 

DGAT properties and amino acid composition 

This study has analyzed the properties and amino acid 
composition of 109 full-length DGATs from 70 organisms. 
The average DGATIs are 171 amino acid residues longer 
than DGAT2s resulting in approximately 20 kDa differ- 
ence in the molecular mass. Other DGAT properties are 
similar: both are basic proteins under neutral pH with 
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580; 

DmDGATl-NP_609813.1( 
DmDGATl-NP_995724.li 
TcDGATl-XP_97514Z.l( 
BbDGATl-AAZ22403.ll 
BtDGATl-NP_777118.2i 
OaDGATl-NP_001103634.1 
SsDGATl-NP_999215.1 
CfDGATl-XP^849176.1 
CfDGATl-XP_858062.li 
HsDGATl-NP_036211.2( 
MmDGATl -XP_001090134.il 
PtDGATl-XP_520014.2l 
MmDGATl-NP^034176.1l 
RnDGATl-NP_445889-ll 
OcDGATl-XP_002724427.li 
MdDGATl-XP_001371565.1 
DrDGATl-NP_001002458.1 
DrDGATl-NP_9 56024-1 
SkDGATl-XP_00273616Q.li 
DdDGATl-XP_645633.2l 
PpDGATl-EFAS5004.1. 
AcDGAT2-EGC41804.li 
PbDGATl-EEH17170.ll 
AoDGAT2-EEQ31683.ll 
MaDGAT2-EFY86774.ll 
MaDGAT2-EFY97444.ll 
TgDGATl-AAP94209.ll 
CvDGATl-E FN 50697. 
PpDGATl-XPJIOl 770929. ll 
SmDGATl-XP_002964165.il 
OsDGATl-NP_001054869.2( 
SbDGATl-XP_002439419-li 
SbDGATl-XP_002437165-l 
ZmDGATl -2 -EUO 398301 
VgDGATl-ABV21945.li 
EpDGATl-AC055635.ll 
OeDGATl-AAS01606-ll 
PfDGATl-AAG23696 
NtDGATl-AAF19345. 
AtDGATl-NP_179535.ll 
BjDGATl-AAY40784.ll 
BnDGATl-AAD45536-ll 
BjDGAT2-A AY 40785. ll 
TmDGATl-AAM03340.2l 
EaDGATl-AAV31083.ll 
GmDGATl-AAS78662.li 
GmDGATl-BAE93461.1 
LjDGATl-AAW51456.1 
MtDGATl-ABN09107.1 
VvDGATl-XP_002279345.ll 
JcDGATl-ABB84383.li 
VfDGATl-DQ3 56680.1 
RcDGAT 1 -XP_002 5 14 1 3 2 . 1 
PtDGATl-XP_002308278.li 
PtDGATl-XP_002330510.1( 

Consensu^ 5 8 



580 



599 





— VNWMALFQQl 
— VNWMALFQQV 
— IQLILC] FQ( Mil S 
FLTQLQVGLIQQWMVPAIQN 
fJqIqVGlIqQWMVPAiM 

fJqIqvglIqqwmvpaiM 

fliqIqvglIqqwmvptiM 

Flj|QpQVGlJ|QQWMVPT ipj 

fJqIqvglIqqwmvpt iH 

fBqIqVGlIqQWMVPT I^jjf 

fHqIqvglIqqwmvpt iH 
fftqlqvgliqqwmvpt i qn 

FfeQpQVGllfQQWMVPT 
FFTQLQVGLIQQWMVPT I QH 
FFTQLQVGLIQQWMVPT I QN 
FpQ|LVGlJjQQWMVPT I§ 
L L T" QLL VGLTQQWMV P I IRS 
FLMQ|MVGL|_QWMVPT if" 
FLIQIVLGI 
SLSLLILWMVNC 
FLSi 

GLSVFIWLTSAC 
GLSVFIWLTSi 
GLGVFIWII SAC 
CLSAFII 
CLSAFII 

IKIVVDQHlVPVAl 
" _foPTII 
IQYINPTVKN 
G . YMNPIIRN 
■QGFI IE Q Y IN P I V VN 
■QcHpQY I N PIVVN 
VFTGLMGFI IEQYINP I VKH 
VFTGLMGFI IEQY IN PIVKN 
I FTGFMGFI IEQYINP I VKN 
I FTGLMGFI I E Q Y IN P I VKN 
I FTGFMGFI VEQY IN PIVRN 
I FTGLMG FI I E QYTNP I VQN 
I FTGLMGFI IEQY IN PIVQN 
I FTGFMGFI IEQYINP I VRH 
I FTGLMGFI IEQY IN PIVRN 
I FTGLMGFI IEQY I N PIVRN 
I FTGLMGFI IEQY I.N PIVRN 
VFI GLMGFI IEQY IN PIVRN 
I FTGLMGFI IEQYINP I VQN 
I FTGVMGFI IEQY I N P I VQN 
I FTGVMGFI IEQYINPIVQN 
I FTGVMGFI IEQY IN P I VQN 
I FTGVMGFI IEQYMN P I VQN 
I FTGVMGFI IEQYINP I VQN 
I FTGFMGFI IEQY IN P I VQN 
I FTGFMGFI IEQYINP I VQN 
I FTGFMGFI IEQY I N P I VQN 
I FTGFMGFI IEQYINP I VKtl 
I FTGFMGFI IEQY IN P I VQN 
FTGL FIIEQYI P VQN 



DmDGATl-NP_609813. 
DmDGATl-NP_995724. 
TcDGATl-XP_975142.li 
BbDGATl-AAZ22403.li 
BtDGATl-NP_777118. 
OaDGATl-NP_00 1103634 
SsDGATl-NP_999216 
CfDGATl-XP_849176 
CfDGATl-XP_858062 
HsDGATl-NP_036211.2i 
MmDGATl-XP_001090134.li 
PtDGATl-XP_5 200 14.2i 
MmDGATl-NP_034176 
RnDGATl-NP_445889 
OcDGATl-XP_002724427 
MdDGATl-XP_001371565 
DrDGATl-NP_001002458.li 
DrDGATl-NP_956024.li 
SkDGATl-XP_002736160.li 
DdDGATl-XP_645633.2i 
PpDGATl-EFA85004.li 
AcDGAT2-EGC41804.li 
PbDGATl-EEH17170 
AoDGAT2-EEQ31683 
MaDGAT2-EFY86774.li 
MaDGAT2-EFY97444.li 
TgDGATl-AAP94209 
CvDGATl-EFN50697 
PpDGAT 1-XP_00 1770929 
SmDGATl-XP_002964165 
OsDGATl-NP_00 1054869.2 
SbDGATl-XP_002439419 
SbDGATl-XP_002437165 
ZmDGATl-2-EU039830( 
VgDGATl-ABV21945 
EpDGATl-AC055635 
OeDGATl-AAS01606 
PfDGATl-AAG23696 
NtDGATl-AAF19345 
AtDGATl-NP_179535 
B]DGAT1-AAY40784 
BnDGATl-AAD45536 
B]DGAT2-AAY40785 
TmDGATl-AAM03340.2i 
EaDGATl-AAV31083 
GmDGATl-AAS78662.li 
GmDGATl-BAE93461.li 
L]DGAT1-AAW51456. 
MtDGATl-ABN09107. 
VvDGATl-XP_002279345. 
JcDGATl-ABB84383. 
VfDGATl-DQ356680. 
RcDGATl-XP_002514132. 
PtDGATl-XP_002308278. 
PtDGATl-XP_002330510.li 



625] i 
396) I 



.ALPMli_ J Cfti.:.;i , . YL:l.. :I;"-L_[;;AVG. ^_,/. .! ■■■ l-J. ' . -;t . ALL _ j'l'l^RTWNMPVHPJSCVRHL Y 1 L 'VVQMG Y5SRQASTI\ F 



446] LALPMULCftLCi'l'iLMi 1 i!SLLNAVG_LL:\ \ .L'i-.N. 



ILFNSI 
WLF SCP 



LLGEILt: F'ADRNi'Ybfct 
AVAELMQ r ( IDREI Y ?, II 
AV .. LM< 
Av|] 

iLMQt'tORE . s -I 'V 



WNANNIDTFWRT'.CJM. 

WNANN I dtfwrtwnl: 
zJNSESITYFWLNWNI 

IQlFltDRElf'Y'b.fct WNSESITYFWQNWNII 
ELMC IDREi 'Y • )V WNSESITYFWQNWNIPV1 
IQpf DRElf'Yp.pt WNSESVTYFWQNWNIPVHI 
WNSESVTYFWQNWNIPVHI 
■'tDREtYfefcl WNSESVTYFWQNWNIPV1 
AV ELMQ|F( IDREI 'Y 1 ft WNSEsItYFWQNWNII 

Ityfwqnwnii 

lCjF^DRE^ Y^)I>jWN SE SVT YFWQNWN I P' 

WNAESVTYFWQNWNIPVH] 
WNAESVT YFWQNWN I PVH1 
WNSESVTYFWQNWNIPVHI 
WNSESVT Y FWQNWN I PVH] 
WNSETITYFWQNWNIPVH1 
WNSET I P YFWSNWN I PVH] 
PVH1 

wnstgldyfwrtwnmpvh: 



LAIPNIILAWLCMF 
LAVPNHLIWLIFF 
LAVPNHLIWLIFF WLF SC 
LAVPNHLIWLIFF WLF SC 
LAVPNHLIWLIFF WLF SC 
LAVPNHLIWLIFF 
LAVPNHLIWLIFF 
LAVPNHLIWLIFF WLF SC: 
LAVPNHLIWLI 
LAVPNHLIWLIFF WLF SC 
LAVPNHLIWLIFF 
LAVPNHLIWLIFF WLF SC 
LAVPNHLIWLIFF WLF SC 
LAVPNHFIWLIFF 
LAVPNHLLWLIFF SFFYSSM 
LAVPNHFIWLIFF WY1 

LAIPNHFIWLIWFYWFFHSCLNVLAEVLH 'tDREi i -'. w WNSETLG YFWQNWN I 



WFF SC A\ 

WFF S CMN A V AE LMQ|F 

WLI 

FF WLF SC AV E LMot'it DRE it y|r |) id WN S E S ' 
WLF SCLNAV ELMC r ( IDREI'Y I IV 
WFF SC AV ELLQ ?< IDREi 'Y 5; It 



AV LXLC :'' IL'KE: "i \ IV 
AVGELMR r f IDRQI 'Y 1 ft 
.ELMC ri IDREI Y * II 
,LLR 7 f IDREI 'Y 5. ft 
FHSSMNFVAEIMC r < IDREI 'Y * II 



A_A. . 

fmae: 



LSLPNLYVWLLGFYVFFHLYLNIVAEITB J'< IDREI 'Y S ill 

LSLPNLYVWLLGFYVFFHLYLNICAELTB n IDREI 'i - : . mJSTGLDYFWS'TWNMPVHi 




It WNSPSVGTY 

:v nnspsIgtI 

:v ril^SPSVGSY 
EVLB|F , tDRS*Y|Dfcl WNSESLGAY 
,VLR r < IDRSI Y 3 ft WNSESLGA 



;vmf: r i idrei 'Y r: 

EVMR J'( IDREI Y r: 
,VMR r f IDREI Y J 



lstisliiwlagffalfqsf: 
lstisliiwlagffalb'qsy. 
l^t i s bv i wlag f falfqs . 

lIt I S BV I WLAGF FALFQS 
L§T I siyi WLAGFFALFQS: 
LSI PNLYVWLLMFVGLFHi 
LSIPTLYWWLAMFYTLF1 
L S L PVL YV W LC L F YC L FHLWLN I VAE L LFflFlt DRE f YlKfct 
LSIPTLYVWLGFFYCF FHLWLN I V AE I L C 1\ IDREI 'Y < It 
L S L PNVY LWLCMFYAF FHLWLS I LAE I LR r f IDREI Y < ft 
LSLPNVYLWLCMFYCLFHLWLN I LAE I LB fl IDRE! Y < II 
' YCF FHLWLN I LAE LLC IL'KE. "i < )V 



LSVPTLYVWLCMF' 
L S VPTL YVWl1mF1C| 

lsvpnlyvwlcm: 
lsvpniyvwlcmf 
lsvpnlyvwlcmf 
lsvpnlyvwlcm: 
lsvpnlyvwlcmfyc; 
lsvpnlyvwlcmfycffl 
lsvpnlyvwlcmfycff 
lsvpnlyvwlcmfycff 
lsvpnlyvwlcmfycf fhlw 
lsvpnlyvwlcmfysff 
lsvpnlyvwlcmfyc lf 
lsvpnlyvwlcmf 
lsvpnlyvwlcmf' 
lsvpnlyvwlcmfyc 
lsvpnvyvwJmfIcI 
lsvpnlyvwlcmfycf: 
lsvpnlyvwlcmfyc: 



:Fj I1LWLNIVALLLC .'; IL'.KE. -v < )■: 
'YCF FHLWLN I LAE LLC r f IDPEI Y i ft 
'YCF FHLWLN I LAE LLC r f IDREI Y <. ft 
'YCF FHLWLN I LAE LLC ?t IDREI Y < II 
'YCF FHLWLN I LAE LLC r f IDRElJ Yp.f\ 
:FFHLWLNILAELLC J'( IDREI 



, hlwlnila^li j '::. , ;;l':-e. . < 
'hlhln i lae llc j'( idre. 

ILNILAELLC r ( IDREtYfcty 
'HLHLNILAELLB ?l IDRE. 
'HLWLNILAELLC r f IDRE^Y^fl 
YCF FHLWLN I LAE L L B J'( IDBE: 
'YCF FHLWLN I LAE L LB 7 f iDPElt'Ykfct 
:FFHLHLNILAELLB J'( IDREfYfipi 
F FHLWLN I LAE L L B J'( IDBE: 

'HLWLN I LAE L LB r f IDREfY^fl 

:ffhlwlnilaellb j'i idre: 
lsvpnlyvwlcmfycf fhlwln i lae llp ?< idref y|<jft 
lsvpnlyvwlcli-'ycf fhlwln i v ae l l b :'( idre: 

L S VPNL YV W LCM F YC F FHLWLN I LAE LLC r f IDREltYtefct 
L S VPNL Y V W LCM F YC F FHLWLN I LAE LLP J'< IDBE I 'Y < It 



.EVTB J'dDRQ. -v L )V ■ 
IAELLB Ti IDBE. "i < IV t 



Consensus(625) _:TvF:--l_,-.-VftL-'.: .'i _■ _■■.!_. ■','_[■. _ _r 



HLWLNILAELLC ILR E. ■ •: t d .AK:=VGL' Y'.-.RMWMMPVHKW 



CV L IPWQM YSSRQ STI^ 



L FPLVEL YQ ;> 
F KPML R SS 
F KPMLRR SS 
F KPMLRR SS 
F KPMLRR SS 



F KPMLRR SS 



.HWMWLI 
.HWMVVMI 



L 1 KBL 
F KPML 
L Km 



A. :AL L 
LKAL'R 



rif.ASi 



BGAGK L L £ Q- 3 A \ I: 
IRGAN R L AAQ I A\ F 
R YTRMQGQVA*, F ILSAI 
FSKNMGYFMC F ~VSAI 
YSKNAGYFMt F ^VSAI 
TWNKPVYHFMRRHIFSPLVGR WSPFA|SVM\ F ^SGIllH 
HyHfIr IFSPLIGR flSPFAASVM\ F ^SGII H 
YHFMKPJ IFSPLVGR rtSPFAASLV\ F n FSAVI H 
HVYVPMIGR WSPWA SC. 
VI MIGR /JSPWA 
INKPLRRARFPRI V TS' 



L : ^_ 
E ; SI 
LVRHVYFPC] 



.33FGE 
lTT /GE 
WNAQTVEEYWRMWI 
WNAKSVDEYWRLW1 
WNAKT I DEYWRKW1 
/JNAKTIDEY' RK NM 
WNAKTVEEYWRMWNMPVH1 
rtNAKTVEE Y WRMWNM PVH] 
WNAQT I EEYWRLWNMPVH1 
WNAQT I EEYWRMWNMPVHKWMV 
WNAKTVEE Y WRMWNM P VHKWI 
WNARTVEEYWRMWNMPVHKWMV 
WNAKT IDE Y WRMWNM P VHK WM V 




-itw:jk 

IsWNRl 

stwnrpvytyfk 
^twnrpvyty fkrh 
IkwnlpiIhfli 
Slwnqi 



L 



f . AK 3 V G L) Y RMH : IM_ VilKV, 
rif . AK3 V G D Y WRMWNM PVHKWI 
rtl-; AK 3 VGD YWRMWNMPVHKWI 
WNAKT VAE YW KMWNM PVHRWI 
WN AKT VE EY WRMWNM P VHK WM V 
WN AKT VE D Y WRMWNM P VHKWM I 
WN AKT VE D Y WRMW NM PVHKWI 
WN AKT FE E Y WRMWN M ? VHK W 
WN AQ T VE EY WRMWNM PVHKWI 
WNAKTVEEYWRMWNMPVHKWJ 
WN ART VE EY WRMWNM PVHKWM V 
WNARTVEEYWRMWNMPVHKWMV 
WNAKTVEE Y WRMWNM ? VHK W V 
WNARTVEEYWRMWNMPVHKWMV I FPC NKIR 
WNARTVEEYWRMWNMPVHKWMV I FPC "NKIR 
ftWWNA V YWR WNM PVHKWM RH Y P LR G 



GIT\ F 
WAARTA\ F ASAFE 
WAARTA\ F jASAFE 
WAARTG^ F ASAFE 
WV RMG\ F jASAFE 
WV RTG\ F 
R1G\ F 
WMARTG\ F jASAFE 
F KPMLRR SSRWMARTG 1 , F ASAFE 
F KPMLRR SSKWMARTG\ F jASAFE 
F KPML H SSKWV RTG\ F jTSAFE 
:WMARTG\ F JASAFE 



B NSRW\ 



F KPALRM VSKWMAKTC^FJjASAFE 



L I 'CM 
FPC I 

E : CI 

FPC I 
FPC 



::i<: 

FPC 



E : C 

FPC 
I PC 
FPC 
_ FPC 
FPC 
_ E : C 
L YPC 
_j FPC 
FPC 
L FPC 
FPC 
E . C 
FPC 



V3AVI II 
^VSAVI H 
JSALI H 
^VSAVIH 
A LSKKAAVLL^FAISALFH 



RTG\ F jASAFE 



LHKQF ILV\ 
IS] E\ VLI; ■ 
ISKEVAAFIA F 
FSRGV ILVE F .VSAVI b 
K FSRGV ILI£ F ,V3AVEi: 

:gaailvaf 'msave|h 
fpc ngmpkelailiaf 
fpc "ngmprgg ilia f 
ip iv vlipf 
ip gv ilia f 
skip tlaiii?f 
ikip vp iiiaf 

_l:_l : vl : IIIAE 

IKIF VF IIIAE 

_L : EG 1 1 lip 

_L : GV FVI; 

_L : AV LLIAlE 

LF AA LLIAE 

_L : GV ILIA E 



GVSVVIi E 

hkiprgvallia|f 

HKIPRG^ 
RRKIPRG 1 



LVAVL 
L'3AVE 
I SAFE 



I3G1EI, 
,V3AVI li 
V3AVJ3I, 



I3AIEII 
ISAIEH 
,VSAIE H 
V3AVEII 
,V3AVE H 
V3AVE li 
V 3 AVE II 
,V3AVE I! 
V3GAE li 
,V3AVE I! 
VSALE 11 
.V3ALEI! 
V3ALEE 



IP GAAALTAjF jVSAVFH 1LCIAVP 
iISAIEH 1LCIAVP 



'V3AVE li 
'"V3AVE li 

VSAVEH 
,V3AVE I! 

VSAVI li 
3A Lt~ 



721 

ylvsv: 

YLVSVP 
YMVSVP 
YLVSII 
YLVSII 
YLVSII 
YLVSII 
YlIsiP 
YLVSII 
YLVSVP 
YLVSVP 
YLVSVP 
YLVSVP 
YLVSII 
YLVSII 
YLVSVP 
lYLVSVP 
YLVSVP 
Yl|siP 
LVISIP 
LVISVI 
LLlGIP 

llIgip 

VLVGII 
IVLVGVP 
.VLVGVP 
YMITVI 
.VLVGVP 
VIIGVI 
ICIAVI 
ICVAVP 
LCVAVP 
ICIA' ■ 
ICIAVI 
LCI A' ■ 
LCI A' ■ 
LCI AVI 
LCVAVP 
LCI AVI 
LCI A' 
LCI AVI 
LCI AVI 
LCI A' ■ 
LCI AVI 
_A:i A 
LCI AVI 
LCI A' ■ 
LCI AVI 



LCIAVP 
LCI A 
LCIAVI 
LCI A' ■ 
LCIAVI 
V VP 



Figure 4 Identification of completely conserved amino acid residues in sequence motifs 4-5 of DGATIs (A) DGAT1 -Motif 4 (QP Block), (B) 
DGAT1 -Motif 5 (LWLFFEFDRFYWWNWWNPPFSHP Block; The first boxed F residues are also conserved in DGAT2s; The boxed Y, W and the 
second F residues are mutated in TmDGATI and the boxed H residue is mutated in MmDGATI). Refer to Figure 3 legend for additional 
information. 



high isoelectric points (Table 2). The frequency of func- 
tional amino acid residue groups between DGAT1 and 
DGAT2 subfamilies is also very similar in terms of 
charged residues, acidic residues, basic residues, polar resi- 
dues and hydrophobic residues (Table 2). The remarkable 
feature of DGATIs and DGAT2s is that both subfamilies 
of proteins contain more than 40% of hydrophobic resi- 
dues (Table 2). These high amounts of hydrophobic resi- 
dues in DGATs are in agreement with them being integral 
membrane proteins [33,34] with multiple transmembrane 
domains [13,33,34], localized to endoplasmic reticulum of 
plant and animal cells [13,34], and associated with mito- 
chondria in COS-7 cells [35,42] and lipid bodies in 3T3- 
Ll adipocytes [42]. The membrane association of the pro- 
teins presents extra huddle to purification of recombinant 
DGATs from any source [43,44]. 

Catalytic and regulatory domains of DGATs 

Generally speaking, critical amino acid residues of 
proteins are conserved during the evolution because 



they are essential for enzymatic activity. The con- 
served amino acid residues are clustered at the active 
centers of the enzymes. Multiple sequence alignment 
has shown that DGATIs and DGAT2s have 41 and 16 
completely conserved amino acid residues, respec- 
tively. Most of them are located at the carboxyl ter- 
mini of DGATs (Table 3). This sequence analysis 
suggests that the catalytic domains of DGATs are 
located at the carboxyl termini of the proteins. This is 
supported by mutations of some completely conserved 
amino acid residues in the C-termini of these proteins 
resulted in complete loss of the enzymatic activity of 
DGATs (see below). This suggestion is in line with 
our previous assignment of the catalytic domains of 
ADPGlc-dependent a-l,4-glucosyltransferases and a- 
1,6-glucan hydrolases from plants and prokaryotes at 
the carboxyl termini of the enzymes because of the 
presence of the conserved amino acid residues and 
sequence motifs in the different isoforms from diverse 
organisms [6,9]. 
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Figure 5 Identification of completely conserved amino acid residues in sequence motifs 6-7 of DGATIs. (A) DGAT1 -Motif 6 (FQL Block), 
(B) DGAT1 -Motif 7 (NGQPY Block). Refer to Figure 3 legend for additional information. 



Several lines of evidence suggest that the regulatory 
domains of DGATs are located at the amino termini of 
the proteins. First, a recent study showed that the 
amino terminal domain of DGAT1 of mouse is not 
required for the catalytic activity of DGAT1 but may be 
involved in regulating enzyme activity and dimer/tetra- 
mer formation [45]. Second, the N-terminal region of 
mouse DGAT2 or yeast DGAT2 is not essential for 



DGAT activity in vitro [33,35]. Finally, mutagenesis of a 
putative protein kinase SnRKl (SNFl-related kinase 1) 
target site at S197 to alanine in TmDGATl results in a 
3896-80% increase in DGAT1 activity, and over-expres- 
sion of the mutated TmDGATl in Arabidopsis results 
in a 2096-50% increase in oil content on a per seed basis 
[25]. This serine residue is conserved in most of the 
plants and located at the N-termini of DGATIs (Figure 
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PtDGAT2-XP_527842. 2(184] 
DdDGAT2-XP_635762. 1(174] 
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NvDGAT2-XP_001635548. 1(157] 
TgDGAT2-XP_002187643. 1(179] 
MdDGAT2-XP_001365685. 1(179] 
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CeDGAT2-NP_872180. 1(183] 
TcDGAT2-XP_975146. 1(184] 
AgDGAT2-NP_983542. 1(305] 
ScDGAT2-NP_014888. 1(262] 
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CrDGAT2-XP_00 1694904. 1(178] 
CrDGAT2-XP_001693189. 1(175] 
OtDGAT2-XP_003083539. 1(175] 
PpDGAT2-XP_001758758. 1(168] 
PpDGAT2-XP_001777726. 1(164] 
SmDGAT2-XP_002972054. 1(195] 
PsDGAT2-ABK26256. 1(183] 
SpDGAT2-AAQ89590. 1(177] 
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Figure 6 Identification of completely conserved amino acid residues in sequence motifs 1-3 of DGAT2s (A) DGAT2-Motif 1 (PH Block; 
The boxed H, P and H residues are mutated in MmDGAT2 and ScDGAT2), (B) DGAT2-Motif 2 (PR Block; The boxed P residues are also conserved 
in DGATIs), (C) DGAT2-Motif 3 (GGE Block). Multiple sequence alignment was performed using 54 full-length DGAT2 protein sequences from 44 
organisms (listed in Table 1). The completely conserved amino acid residues are highlighted in red on yellow. Other color code and related 
information are described briefly in Figure 2 legend and with details in "Methods" section. 



9A). All of the above mentioned sequence analysis and 
experimental evidence support the concept that the cat- 
alytic and regulatory domains of DGATs are located at 
the C- and N-termini of the enzymes, respectively. 

Functional significance of less conserved motifs in 
DGAT2s 

Recent studies have reported functional motifs in 
DGATs including putative neutral lipid-binding domain 
(FLXLXXX n in mouse DGAT2) [34], mitochondrial tar- 
geting signal (RXKXXK in mouse DGAT2) [35] and ER 
retrieval motif (LKLEI in tung DGAT2) [13]. However, 
the conserved sequence motifs identified by our exten- 
sive sequence analysis do not contain any of these 
reported motifs. In our analysis, the putative neutral 
lipid-binding domain [34] which was shown to be pre- 
sented in proteins that either bind to or metabolize neu- 
tral lipids [36], is only modestly conserved in animal 
DGAT2s and not present in any plant DGAT2 (Figure 



8A). Similarly, the putative mitochondrial targeting sig- 
nal is only found in a few animal DGAT2s but not con- 
served in any plant or fungi DGAT2 (Figure 8B). This 
sequence analysis also shows that the pentapeptide 
(LKLEI) ER-retrieval motif identified at the extreme car- 
boxyl terminus of tung DGAT2 [13] is only modestly 
conserved in plant DGAT2s but not in animal or fungus 
DGAT2s (Figure 8C). All these studies point out that 
less conserved regions in a subset of DGATs may play 
specific roles in TAG biosynthesis in that particular sub- 
set of organisms. 

Functional significance of the completely conserved 
residues 

Multiple sequence alignment has shown that 55 
DGATIs and 54 DGAT2s have 41 and 16 completely 
conserved amino acid residues, respectively, although 
only two residues are completely conserved among all 
DGATs (Table 3). It is likely that these completely 
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Figure 7 Identification of completely conserved amino acid residues in sequence motifs 4-6 of DGAT2S (A) DGAT2-Motif 4 (RGFA Block), 
(B) DGAT2-Motif 5 (VPFG Block; The boxed F residues are also conserved in DGATIs), (C) DGAT2-Motif 6 (G Block). Refer to Figure 6 legend for 
additional information. 



conserved amino acid residues are critical for DGAT 
enzymatic activities. These residues may be involved in 
substrate binding, direct catalysis, and/or maintenance 
of protein structure including oligomer formation. The 
importance of some conserved residues in DGATIs has 
been demonstrated by site-directed mutagenesis (Table 
4). Mutagenesis at H426 in mouse DGAT1 to alanine 
impairs the ability of DGAT1 to synthesize triacylglycer- 
ols, retinyl and wax esters in an "in vitro" acyltransferase 
assay [45]. This histidine residue is completely con- 
served in Motif 5 of all DGATIs (Figure 4B). Similarly, 
mutagenesis at Y392, W395 and F439 in Tropaeolum 
majus DGAT1 eliminates nearly all activity [25]. These 
three residues are also completely conserved in Motif 5 
of all DGATIs (Figure 4B). All four residues are located 



in the most conserved region of DGATIs in which 23 
completely conserved residues are located in Motif 5 of 
the multiple sequence alignment (Figure 4B). 

The importance of the completely conserved residues 
in DGAT2s is also supported by site-directed mutagen- 
esis. Mutagenesis at H161, P162 and H163 sites, and the 
triple mutant in mouse DGAT2 results in a substantial 
loss of activity (Table 4) [34]. Mutation at the corre- 
sponding sites at H193 and H195 in DGAT2 of baker's 
yeast results in complete loss of the activity (Table 4) 
[33]. These three resides are located in the highly con- 
served Motif 1 (PH Block) of DGAT2s (Figure 6A). 
These results suggest that they may be located at the 
active center of DGATIs, but the precise roles of these 
residues either involved in substrate binding or catalysis 
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Figure 8 Sequence analysis of important motifs in less conservative regions of DGAT2S. Multiple sequence alignment was performed 
using 54 full-length DGAT2 protein sequences from 44 organisms (listed in Table 1). Color code and related information are described briefly in 
Figure 2 legend and with details in "Methods" section. The motifs are boxed within the sequence alignment. (A) Putative neutral lipid-binding 
domain (FLXLXXX in mouse DGAT2) (B) Mitochondrial targeting signal (RXKXXK in mouse DGAT2), (C) ER retrieval motif (LKLEI in tung DGAT2). 



are not clear. Further experiments are required to assess 
the contribution of the other completely conserved resi- 
dues to the enzymatic activity of DGATs. 

Functional significance of the less-well conserved 
residues in site-directed mutants 

The importance of some less conserved residues in 
DGATs has also been demonstrated by site-directed 
mutagenesis (Table 4). As described above, mutation at 



S197 (a putative SnRKl target site) in TmDGATl 
results in a 3896-80% increase in DGAT1 activity. This 
serine residue is conserved in most of the plants (Figure 
9A). In addition, mutagenesis at E145 in Motif 1 of Tro- 
paeolum majus DGAT1 results in the loss of almost 
half of the activity [25]. This glutamate residue is con- 
served in all plant DGATls and most other DGATls 
except bird, chimpanzee, Dictyostelium discoideum, 
Polysphondylium pallidum and Metarhizium acridum 
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Table 4 Site-directed and natural mutants of DGATs and their effects on enzymatic activity and TAG accumulation. 



DGAT 



Amino acid sequence with altered residues underlined 



Mutation 



Activity 



Reference 



MmDGATI- 1 MGDRGGAGSS RRRRTGSRVS VQGGSGPKVE EDEVRDAAVS 

NP_034176.1 PDLGAGGDAP APAPAPAHTR DKDGRTSVGD 

[Mus musculus] 71 GYWDLRCHRL QDSLFSSDSG FSNYRGILNW CWMLILSNA 

RLFLENLIKY GILVDPIQW SLFLKDPYSW 

141 PAPCVIIASN IFWAAFQIE KRLAVGALTE QMGLLLHWN 
LATIICFPAA VALLVESITP VGSVFALASY 

211 SIMFLKLYSY RDVNLWCRQR RVKAKAVS TG KKVSGAAAQQ 
AVSYPDNLTY RDLYYFIFAP TLCYELNFPR 

281 SPRIRKRFLL RRVLEMLFFT QLQVGLIQQW MVPTIQNSMK 
PFKDMDYSRI IERLLKLAVP NHLIWLIFFY 

3 51 WFFHSCLNAV AELLQFGDRE FYRDWWNAES VTYFWQNWNI 
PVHKWCIRHF YKPMLRHGSS KWVARTGVFL 

421 TSAFFHEYLV SVPLRMFRLW AFTAMMAQVP LAWIVGRFFQ 
GNYGNAAVWV TLIIGQPVAV LMYVHDYYVL 

4 91 NYDAPVGV 



f'15] 



TmDGATI- 
AAM03340.2 
[Tropaeolum majus] 



1 MAVAESSQNT TTMSGHGDSD LNNFRRRKPS SSVIEPSSSG 
FTSTNGVPAT GHVAENRDQD RVGAMENATG 
71 SVNLIGNGGG WIGNEEKQV GETDIRFTYR PSFPAHRRVR 
ESPLSSDAIF KQSHAGLFNL CIWLIAVNS 

141 RLIIENLMKY GWLIDTGFWF SSRSLGDWSI FMCCLTLPIF 
PLAAFIVEKL VQRNHlSELV AVLLHVIVST 

211 AAVLYPVIVI LTCDSVYMSG WLMLFGCIM WLKLVSYAHT 
SSDIRTLAKS GYKGDAHPNS TIVSCSYDVS 

281 LKSLAYFMVA PTLCYQPSYP RSSCIRKGWV VRQFVKLIVF 
IGLMGFIIEQ YINPIVRNSK HPLKGDFLYA 

351 IERVLKLSVP NLYVWLCMFY SFFHLWLNIL AELLRFGDRE 
FYKDWWNAKT VAEYWKMWNM PVHRWMVRHL 

421 YFPCLRNGIP KEGAIIIAFL VSGAFHELCI AVPCHVFKLW 
AFIGIMFQVP LVLITNYLQE KFSNSMVGNM 
491 IFWFIFCILG QPMCVLLYYH DLINLKEK 



E145V 
S197A 
P216R 
Y392A 
Y392A/ 
W395G 
F439R 



-43% 
h38-80% 
-100% 

-80% 
-100% 
-100% 



[25] 



MmDGAT2- 


1 MKTLIAAYSG VLRGERRAEA ARSENKNKGS ALSREGSGRW 


F8 0A 


-66% 


[34] 


NP_080660.1 


GTGSSILSAL QDIFSVTWLN RSKVEKQLQV 


L81A 


-85% 




[Mus musculus] 


71 ISVLQWVLSF LVLGVACSVI LMYTFCTDCW LIAVLYFTWL 


L83A 


-100% 






AFDWNTPKKG GRRSQWVRNW AVWRYFRDYF 


H161A 


-50-60% 






141 PIQLVKTHNL LTTRNYIFGY HPHGIMGLGA FCNFSTEATE 


P162G 


-50-60% 






VSKKFPGIRP YLATLAGNFR MPVLREYLMS 


H163A 


< 20% 






211 GGICPVNRDT IDYLLSKNGS GNAIIIWGG AAESLSSMPG 


H161A/ 


< 20% 






KNAVTLKNRK GFVKLALRHG ADLVPTYSFG 


P162G/ 








281 ENEVYKQVIF EEGSWGRWVQ KKFQKYIGFA PCIFHGRGLF 


H163A 








SSDTWGLVPY SKPITTWGE PITVPKLEHP 










351 TQKDIDLYHA MYMEALVKLF DNHKTKFGLP ETEVLEVN 








ScDGAT2 (Dgalp)- 


1 MSGTFNDIRR RKKEEGSPTA GITERHENKS LSSIDKREQT 


CC48/49AA 


-30% 


[33,37] 


NP_014888.1 


LKPQLESCCP latpferrlq tlavawhtss 


C12 7A 


-40% 




[Saccharomyces 


71 FvLfsiftlf aistpalwvl aipymiyfff drspatgew 


C183A 


-10% 




cerevisiae] 


NRYSLRFRSL PIWKWYCDYF PISLIKTVNL 


C211S 


-10% 






141 KPTFTLSKNK RVNEKNYKIR LWPTKYSINL KSNSTIDYRN 


C2S4A 


-20% 






QECTGPTYLF GYHPHGIGAL gafgafateg 


C314A 


-15% 






211 Cnyskifpgi pislmtlvtq fhiplyrdyl lalgissvsr 


F71A 


-60% 






KNALRTLSKN QSlClWGGA RESLLSSTNG 


L73A 


-40% 






2 81 TQLILNKRKG FIKLAIQTGN INLVPVFAFG EVDCYNVLST 


Y12 9A/ 


-100% 






KKDSVLGKMQ LWFKENFGFT IPIFYARGLF 


F130A/ 


-100% 






351 NYDFGLLPFR APINVWGRP IYVEKKITNP PDDWNHFHD 


P131A 


-100% 






LYIAELKRLY YENREKYGVP DAELKIVG 


H193A 
H195A 







ZmDGAT1-2- 1 MAPPPSMPAA SDRAGPGRDA GDSSSLRLRR APSADAGDLA 

EU039830 GDSSGGLREN GEPQSPTNPP PQEQQQHEML 

[Zea mays] 71 YYRASAPAHR RVKESPLSSD AIFRQSHAGL LNLCIWLIA 

VNSRLIIENL MKYGLLIRAG FWFSARSLGD 

141 WPLLMCCLTL PVFPLVALMA EKLITRKLIG EHWILLHII 
ITTSAIVYPV WTLKCDSAV LSGFVLMFLA 

211 SIMWMKLVSY AHTNYDIRVL SKSTEKGAAY GNYVDPENMK 
DPTFKSLVYF MLAPTLCYQP TYPQTTCIRK 

281 GWVTQQLIKC WFTGLMGFI IEQYINPIVK NSKHPLKGNF 
LNAIERVLKL SVPTLYVWLC MFYCFFHLWL 

351 NIVAELLCFG DREFYKDWWN AKTVEEYWRM WNMPVHKWII 
RHIYFPCIRK GFSRGVAILI SFLVSAVFHE 

421 ICIAVPCHIF KFWAFSGIMF QIPLVFLTRY LHATFKHVMV 
GNMIFWFFFS IVGQPMCVLL YYHDVMNRQA 
4 91 QASR 



F4 6 9 
insertion 



h41-107% oil 
contents 



[38] 
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Table 4 Site-directed and natural mutants of DGATs and their effects on enzymatic activity and TAG accumulation. 

(Continued) 

BtDGATI- 1 MGDRGGAGGS RRRRTGSRPS IQGGSGPAAA EEEVRDVGAG K232A Devoid of milk [41] 

AAL49962.1 GDAPVRDTDK DGDVDVGSGH WDLRCHRLQD secretion 

[Bos taurus] 71 SLFSSDSGFS NYRGILNWCV VMLILSNARL FLENLIKYGI 

LVDPIQWSL FLKDPYSWPA LCLVIVANIF 

141 AVAAFQVEKR LAVGALTEQA GLLLHGVNLA TILCFPAAVA 
FLLESITPVG SVLALMVYTI LFLKLFSYRD 

211 VNLWCRERRA GAKAKAALAG KKANGGAAQR TVSYPDNLTY 
RDLYYFLFAP TLCYELNFPR SPRIRKRFLL 

2 81 RRLLEMLFLT QLQVGLIQQW MVPAIQNSMK PFKDMDYSRI 
VERLLKLAVP NHL IWL IFFY WLFHSCLNAV 

351 AELMQFGDRE FYRDWWNSES ITYFWQNWNI PVHKWCIRHF 
YKPMLRRGSS KWAARTAVFL ASAFFHEYLV 

421 SIPLRMFRLW AFTGMMAQIP LAWIVGRFFR GNYGNAAVWL 
SLIIGQPVAV LMYVHDYYVL NREAPAAGT 



(Figure 3A). Mutagenesis at P216 in Tropaeolum majus 
DGAT1 eliminates almost all of the activity [25]. P216 
is completely conserved in plant DGATls but is missing 
in all mammalian DGATls (Figure 9B). Mutation at 
Y129/F130/P131 in DGAT2 of baker's yeast results in a 
complete loss of the activity [33]. These three residues 
are highly conserved but none of them is completely 
conserved among all DGAT2s in our analysis using 54 
full-length DGATs (Figure 10A). Mutations at F80/L81/ 
L83 in mouse DGAT2 [34] and F71/L73 in baker's yeast 
DGAT2 [33] result in partial loss of the DGAT activity 
(Figure 10B). Finally, ScDGAT2 has a unique cysteine 
residue (C314) which is not involved in catalysis but 
may be located near the active site or related to proper 
folding of the protein [37]. However, this residue is only 
found in DGAT2s from baker's yeast and the other two 
fungi Ashbya gossypii and Physcomitrella patens, but is 
not present in any of the other 51 DGAT2s or any of 
the 55 DGATls analyzed (Figure IOC). Nonetheless, 
site-directed mutagenesis indicates that these less con- 
served residues, although not essential, contribute to the 
full activity of DGATs. 

Functional significance of the relatively conserved 
residues in natural mutants 

Two well-known natural mutants in corn and cattle 
demonstrate the importance of some relatively con- 
served residues in TAG biosynthesis (Table 4). Genetic 
mapping has identified a high-oil QTL (qH06) that 
affects maize seed oil and oleic-acid contents associated 
with DGAT1-2 [38]. A phenylalanine insertion (F469) in 
DGAT1-2 is responsible for the increased oil and oleic- 
acid contents. Ectopic expression of the high-oil 
DGAT 1-2 allele increases oil and oleic-acid contents by 
up to 41% and 107%, respectively [38]. This phenylala- 
nine residue is conserved in all plants except Brassica 
napus (rape, AAD45536.1) and conserved in all fungi 
except mold (Dictyostelium discoideum and Polysphon- 
dylium pallidum) (Figure 11A). It is not present in any 



of the animal DGATls or any of DGAT2s. This case 
suggests that oil content can be potentially improved in 
transgenic plants by introducing site-specific amino acid 
substitutions/changes in DGATs. 

DGAT1 knockout mice are completely devoid of milk 
secretion, most likely because of deficient triglyceride 
synthesis in the mammary gland [18]. DGAT1 
sequences from pooled DNA show significant frequency 
shifts at several residue positions between groups of ani- 
mals with high and low breeding values for milk fat 
content in different breeds [41]. Substitution of lysine by 
alanine (K232A) is directly responsible for the QTL var- 
iation with the lysine-encoding allele being associated 
with higher milk fat content [41]. Both DGAT1 alleles 
are expressed in Sf9 cells, an insect expression system, 
and characterized the expressed proteins. The K allele, 
causing an increase in milk fat percentage in the live 
animal, is characterized by a higher Vmax in producing 
triglycerides than the A allele [46]. This lysine residue is 
conserved in mammals but not in plants, fungi or other 
animals except one of the two forms from dog and zeb- 
rafish (Figure 11B). This case also suggests that lipid 
content can be improved in transgenic animals by 
bioengineering specific amino acid residues of DGATs. 

Conclusions 

Understanding the precise roles of DGATs may help to 
create transgenic plants with value-added properties and 
provide information for therapeutic intervention for 
obesity and related diseases because DGATs catalyze the 
final and rate-limiting step of TAG biosynthesis in 
eukaryotic organisms. This report analyzed 117 DGAT 
sequences from 70 organisms ranging from plants, ani- 
mals and fungi to aid our understanding of the struc- 
ture-function relationship of these important enzymes. 
The report identified conserved sequence motifs and 
amino acid residues in all 117 DGATs and DGAT1 and 
DGAT2 subfamilies, reassigned some DGAT subfamily 
members based on the phylogenetic analysis and 
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Figure 9 Sequence analysis of important amino acid residues in less conservative regions of DGATIs. (A) The boxed S residue is 
mutated in Tropaeolum majus DGAT1 corresponding to S197, (B) The boxed P residue is mutated in Tropaeolum majus DGAT1 corresponding to 
P216. Multiple sequence alignment was performed using 55 full-length DGAT1 protein sequences from 45 organisms (listed in Table 1). Color 
code and related information are described briefly in Figure 2 legend and with details in "Methods" section. The amino acid residues studied by 
mutagenesis and the corresponding conserved residues in other organisms are boxed within the sequence alignment. 



sequence similarities, and discussed the importance of (41 residues) and DGAT2s (16 residues) in the multiple 



some conserved residues with site-directed and natural 
mutants. One interesting observation is that the newly 
reported DGAT3 and DGAT4 sequences do not contain 
any of the completely conserved residues in DGATIs 



sequences alignment. This sequence divergence is in 
contrast to the general belief that the active sites of 
enzymes should be conserved during the evolution 
because all catalyze the same/similar biochemical 
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Figure 10 Sequence analysis of important amino acid residues in less conservative regions of DGAT2s (A) YFP motif (The boxed Y, F 
and P residues are mutated in baker's yeast DGAT2 corresponding to Y 1 29/F 1 30/P 131), (B) The boxed F, L and L residues are mutated in mouse 
DGAT2 corresponding to F80, L81 and L83 and in baker's yeast DGAT2 corresponding to F71 and L73, (Q The boxed C residue is mutated in 
baker's yeast DGAT2 corresponding to C314. Multiple sequence alignment was performed using 54 full-length DGAT2 protein sequences from 44 
organisms (listed in Table 1). Color code and related information are described briefly in Figure 2 legend and with details in "Methods" section. 
The amino acid residues studied by mutagenesis and the corresponding conserved residues in other organisms are boxed within the sequence 
alignment. 



reaction. Therefore, this sequence divergence raises an 
important question how proteins with completely differ- 
ent amino acid sequences could perform the same bio- 
chemical reaction, although some variations of the 
conserved sequence motifs and amino acid residues are 
expected when more sequences of DGATs are used in 
the multiple sequence alignment. 

It has been well-documented that many of the 
enzymes in the oil biosynthesis pathway are not stable. 
Although the precise reasons are unknown, it is possi- 
ble that plants develop a feedback mechanism to regu- 
late the optimal amount of enzymes so that the 
biophysical properties of ER membranes are 



functionally intact without dramatic alterations by 
over-expressed enzymes in the host. If this is the case, 
it may be advantageous to introduce genes with low 
copy numbers but with high catalytic efficiency. This 
concept is supported by three studies with plants 
(S197 in Arabidopsis and F469 in corn) and animals 
(K232 in cattle) which demonstrate the potential to 
increase oil/fat production by altering a single amino 
acid residue of DGAT1. Therefore, the sequence analy- 
sis should facilitate studying the structure-function 
relationship of DGATs with the ultimate goal of iden- 
tifying critical amino acid residues. This will guide the 
construction of superb enzymes for metabolic 
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Figure 11 Sequence analysis of important amino acid residues of DGATIs shown in natural mutants. Multiple sequence alignment was 
performed using 55 full-length DGAT1 protein sequences from 45 organisms (listed in Table 1). The completely conserved amino acid residues 
are highlighted in red on yellow. Other color code and related information are described briefly in Figure 2 legend and with details in "Methods" 
section. The amino acid residues affected by natural mutation and the corresponding conserved residues in other organisms are boxed within 
the sequence alignment. (A) maize DGAT1-2 F468, (B) cattle DGAT1 K232A. 



engineering and rational design of DGAT inhibitors to 
be used for obesity and related diseases. 

Methods 

Database search of DGATs 

DGAT sequences were obtained from Blastp search 
[47,48] using tung tree {Vernicia fordii) DGAT sequences 
[GenBank:DQ356680.1] (DGAT1) and [GenBank: 
DQ356682.1] (DGAT2) [13] against the National Center 
for Biotechnology Information (NCBI)'s non-redundant 



protein sequence databases http://blast.ncbi.nlm.nih.gov/ 
Blast. cgi. Additional DGAT sequences were obtained 
using DGAT1 and DGAT2 search term in NCBI's Protein 
database, http://www.ncbi.nlm.nih.gov/protein. A total of 
109 full-length and 8 near full-length DGATs were 
obtained from 70 organisms including plants (such as Ara- 
bidopsis, barley, caster bean, cauliflower, corn, rape, rice, 
sorghum, soybean, tobacco, tung tree), animals (such as 
bird, chimpanzee, cow, dog, fish, fly, frog, monkey, mos- 
quito, mouse, pig, rabbit, rat, sheep, worm), fungi (such as 
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yeast, mold, moss) and human. The names of DGATs 
used in the analysis and their corresponding organisms, 
classification of DGAT subfamily, and the GenBank acces- 
sion numbers are presented in Table 1. The name of each 
protein sequence consists of the initials of the organism 
followed by the assigned subfamily of DGATs in the data- 
bases and the GenBank accession number. 

Protein analysis 

The properties and amino acid compositions of DGATs 
were analyzed using Vector NTI software (Invitrogen) 
[49]. Statistics was performed using Microsoft Excel. 

Phylogenetic analysis 

Phylogenetic analysis was used to study the presumed 
evolutionary relationships among the 117 DGATs from 
70 organisms. This analysis was performed using the 
Vector NTI software (Invitrogen) based on the Neighbor- 
Joining method of Saitou and Nei [50]. The numbers in 
the parenthesis following DGAT names are the calcu- 
lated distance values which reflect the degree of diver- 
gence between all pairs of DGAT sequences analyzed. 

Multiple sequence alignment 

Multiple sequence alignment was performed using the 
ClustalW algorithm [51,52] of the AlignX program of the 
Vector NTI software. This method is based on algo- 
rithms that assign scores to aligned residues and detect 
sequence similarities. Identical amino acid residues in 
alignment have higher scores than those not identical 
and less similar residues. Each DGAT sequence name is 
on the left of the alignment followed by the position of 
amino acid residue of DGAT protein sequence in the 
alignment. The numbers at the top of the alignment are 
the positions of the multiple sequence alignment. The 
letters at the bottom of the alignment are the consensus 
residues. Color codes for amino acid residues are as fol- 
lows: 1) red on yellow: consensus residue derived from a 
completely conserved residue at a given position; 2) black 
on green: consensus residue derived from the occurrence 
of greater than 50% of a single residue at a given position; 
3) blue on cyan: consensus residue derived from a block 
of similar residues at a given position; 4) green on white: 
residue weakly similar to consensus residue at a given 
position; 5) black on white: non-similar residues. 

Lists of abbreviations 

DGAT: diacylglycerol acyltransferase; QTL: quantitative trait loci; TAG: 
triacylglycerol 
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